AI & RoboticsNews

LLMs generate ‘fluent nonsense’ when reasoning outside their training zone

A new study from Arizona State University researchers suggests that the celebrated “Chain-of-Thought” (CoT) reasoning in Large Language Models (LLMs) may be more of a “brittle mirage” than genuine intelligence. The research builds on a growing body of work questioning the depth of LLM reasoning, but it takes a unique “data distribution” lens to test where and why CoT breaks down…
Read more
AI & RoboticsNews

OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic

A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The framework, called OpenCUA, includes the tools, data, and recipes for scaling the development of computer-use agents (CUAs). Models trained using this framework perform strongly on CUA benchmarks…
Read more
AI & RoboticsNews

MIT report misunderstood: Shadow AI economy booms while headlines cry failure

The most widely cited statistic from a new MIT report has been deeply misunderstood. While headlines trumpet that “95% of generative AI pilots at companies are failing,” the report actually reveals something far more remarkable: the fastest and most successful enterprise technology adoption in corporate history is happening right under executives’ noses. The study, released this week by…
Read more
AI & RoboticsNews

CodeSignal’s new AI tutoring app Cosmo wants to be the ‘Duolingo for job skills’

CodeSignal Inc., the San Francisco-based skills assessment platform trusted by Netflix, Meta, and Capital One, launched Cosmo on Wednesday, a mobile learning application that transforms spare minutes into career-ready skills through artificial intelligence-powered micro-courses. The app represents a strategic pivot for CodeSignal, which built its reputation assessing technical talent for major…
Read more