AI & RoboticsNews

When your LLM calls the cops: Claude 4’s whistle-blow and the new agentic AI risk stack

The recent uproar surrounding Anthropic’s Claude 4 Opus model – specifically, its tested ability to proactively notify authorities and the media if it suspected nefarious user activity – is sending a cautionary ripple through the enterprise AI landscape. While Anthropic clarified this behavior emerged under specific test conditions, the incident has raised questions for technical…
Read more
AI & RoboticsNews

Not everything needs an LLM: A framework for evaluating when AI makes sense

Question: What product should use machine learning (ML)? Project manager answer: Yes. Jokes aside, the advent of generative AI has upended our understanding of what use cases lend themselves best to ML. Historically, we have always leveraged ML for repeatable, predictive patterns in customer experiences, but now, it’s possible to leverage a form of ML even without an entire training…
Read more
AI & RoboticsNews

Former DeepSeeker and collaborators release new method for training reliable AI agents: RAGEN

2025 was, by many expert accounts, supposed to be the year of AI agents — task-specific AI implementations powered by leading large language and multimodal models (LLMs) like the kinds offered by OpenAI, Anthropic, Google, and DeepSeek. But so far, most AI agents remain stuck as experimental pilots in a kind of corporate purgatory, according to a recent poll conducted by VentureBeat on the…
Read more
AI & RoboticsNews

New open source AI company Deep Cogito releases first models and they’re already topping the charts

Deep Cogito, a new AI research startup based in San Francisco, officially emerged from stealth today with Cogito v1, a new line of open source large language models (LLMs) fine-tuned from Meta’s Llama 3.2 and equipped with hybrid reasoning capabilities — the ability to answer quickly and immediately, or “self-reflect” like OpenAI’s “o” series and DeepSeek R1. The company aims to push…
Read more