AI & RoboticsNews

Nous Research drops Hermes 4 AI models that outperform ChatGPT without content restrictions

Nous Research, a secretive artificial intelligence startup that has emerged as a leading voice in the open-source AI movement, quietly released Hermes 4 on Monday, a family of large language models that the company claims can match the performance of leading proprietary systems while offering unprecedented user control and minimal content restrictions. The release represents a significant…
Read more
AI & RoboticsNews

Anthropic launches Claude for Chrome in limited beta, but prompt injection attacks remain a major concern

Anthropic has begun testing a Chrome browser extension that allows its Claude AI assistant to take control of users’ web browsers, marking the company’s entry into an increasingly crowded and potentially risky arena where artificial intelligence systems can directly manipulate computer interfaces. The San Francisco-based AI company announced Tuesday that it would pilot “Claude for Chrome”…
Read more
AI & RoboticsNews

LLMs generate ‘fluent nonsense’ when reasoning outside their training zone

A new study from Arizona State University researchers suggests that the celebrated “Chain-of-Thought” (CoT) reasoning in Large Language Models (LLMs) may be more of a “brittle mirage” than genuine intelligence. The research builds on a growing body of work questioning the depth of LLM reasoning, but it takes a unique “data distribution” lens to test where and why CoT breaks down…
Read more
AI & RoboticsNews

OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic

A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The framework, called OpenCUA, includes the tools, data, and recipes for scaling the development of computer-use agents (CUAs). Models trained using this framework perform strongly on CUA benchmarks…
Read more