Browsing tag

Anthropic AI Model

AI & Robotics News

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

June 23, 2025

Researchers at Anthropic have uncovered a disturbing pattern of behavior in artificial intelligence systems: models from every major provider—including OpenAI, Google, Meta, and others — demonstrated a willingness to actively sabotage their employers when their goals or existence were threatened. The research, released today, tested 16 leading AI models in simulated corporate environments…

AI & Robotics News

The Interpretable AI playbook: What Anthropic’s research means for your enterprise LLM strategy

June 18, 2025

Anthropic CEO Dario Amodei made an urgent push in April for the need to understand how AI models think. This comes at a crucial time. As Anthropic battles in global AI rankings, it’s important to note what sets it apart from other top AI labs. Since its founding in 2021…

AI & Robotics News

Anthropic debuts Claude conversational voice mode on mobile that searches your Google Docs, Drive, Calendar

May 28, 2025

San Francisco AI startup Anthropic has more up its sleeve than the new Claude Opus 4 and Sonnet 4 large language models (LLMs) announced last week — today it has unveiled two major updates for its similarly named Claude AI chatbot: a new conversational voice mode available…

AI & Robotics News

Anthropic just analyzed 700,000 Claude conversations — and found its AI has a moral code of its own

April 22, 2025

Anthropic, the AI company founded by former OpenAI employees, has pulled back the curtain on an unprecedented analysis of how its AI assistant Claude expresses values during actual conversations with users. The research, released today, reveals both reassuring alignment with the company’s goals and concerning edge cases that could help identify vulnerabilities in AI safety measures. The study…