In a paper, Anthropic researchers said they developed auditing agents that achieved “impressive performance at auditing tasks, while also shedding light on their limitations.” The researchers stated that these agents, created during the pre-deployment testing of Claude Opus 4, enhanced alignment validation tests and enabled researchers to conduct multiple parallel audits at scale. Anthropic…
A new startup founded by an early Anthropic hire has raised $15 million to solve one of the most pressing challenges facing enterprises today: how to deploy artificial intelligence systems without risking catastrophic failures that could damage their businesses.
The…
SecurityPal combines AI and experts in Nepal to speed enterprise security questionnaires by 87X or more
July 24, 2025
When a tech vendor wants to sell into a large enterprise — or when that enterprise wants to buy software from a tech vendor or AI model provider — each side may be required by the other to prove they will handle shared data responsibly in the form of mandatory surveys…
One of the fastest-growing segments of the business market faces a technology paradox. They’ve outgrown small business tools but sometimes remain too small for many types of traditional enterprise solutions. This creates a unique AI deployment challenge. How do you deliver intelligent automation across fragmented, multi-entity business structures without requiring expensive platform…
Anthropic researchers discover the weird AI problem: Why thinking longer makes models dumber
July 23, 2025
Artificial intelligence models that spend more time “thinking” through problems don’t always perform better — and in some cases, they get significantly worse, according to new research from Anthropic that challenges a core assumption driving the AI industry’s…
A ChatGPT ‘router’ that automatically selects the right OpenAI model for your job appears imminent
July 22, 2025
In the 2.5 years since OpenAI debuted ChatGPT, the number of large language models (LLMs) that the company has made available as options to power its hit chatbot has steadily grown.
In fact, there are now a total of 7 (!!!) different AI models that paying ChatGPT subscribers…
Chinese startup Manus challenges ChatGPT in data visualization: which should enterprises use?
July 22, 2025
The promise sounds almost too good to be true: drop a messy comma separated values (CSV) file into an AI agent, wait two minutes, and get back a polished, interactive chart ready for your next board presentation.
But that’s exactly what Chinese startup Manus.im is delivering with its latest data visualization feature, launched this month.
Unfortunately, my initial hands-on testing with corrupted…
Google DeepMind makes AI history with gold medal win at world’s toughest math competition
July 22, 2025
Google DeepMind announced Monday that an advanced version of its Gemini artificial intelligence model has officially achieved gold medal-level performance at the International Mathematical Olympiad, solving five of six exceptionally difficult problems and earning recognition…
Remaining Windsurf team and tech acquired by Cognition, makers of Devin: ‘We’re friends with Anthropic again’
July 21, 2025
Autonomous AI coding startup Cognition has signed a definitive agreement to acquire Windsurf, the AI developer tools startup best known for its agentic integrated development environment (IDE). The two companies made the announcement on their respective X accounts on Monday.
Anthropic launches finance-specific Claude with built-in data connectors, higher limits and prompt libraries
July 21, 2025
As some regulated enterprises cautiously expand their use of AI, platform and model makers are starting to offer bespoke versions to specific industries.
Anthropic is making its first step into that direction with the new Claude for Financial Services, essentially a special version of its Claude for Enterprise tier, that could soothe some of the fears of the sector around interoperability and tool…