A new study from Arizona State University researchers suggests that the celebrated “Chain-of-Thought” (CoT) reasoning in Large Language Models (LLMs) may be more of a “brittle mirage” than genuine intelligence. The research builds on a growing body of work questioning the depth of LLM reasoning, but it takes a unique “data distribution” lens to test where and why CoT breaks down…
When OpenAI launched GPT-5 about two weeks ago, CEO Sam Altman promised it would be the company’s “smartest, fastest, most useful model yet.” Instead, the launch triggered one of the most contentious user revolts in the brief history of consumer AI.
Now, a simple blind…
Meta is partnering with Midjourney and will license its technology for ‘future models and products’
August 25, 2025
Even three years after its debut, with ever increasing competition in the AI image and video generation space, Midjourney, the bootstrapped San Francisco startup, remains the “gold standard” for its 20 million users — including us here at VentureBeat, where we use it…
OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic
August 25, 2025
A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The framework, called OpenCUA, includes the tools, data, and recipes for scaling the development of computer-use agents (CUAs).
Models trained using this framework perform strongly on CUA benchmarks…
Busted by the em dash — AI’s favorite punctuation mark, and how it’s blowing your cover
August 25, 2025
Let’s talk about the em dash. Not the little innocent hyphen, not its slightly more confident cousin, the en dash. No, I’m talking about the ‘EM dash,’ that long, dramatic line that AI looooooves to drop in your sentences like it’s getting paid per dash. Seriously…
The Chan Zuckerberg Initiative announced Thursday the launch of rBio, the first artificial intelligence model trained to reason about cellular biology using virtual simulations rather than requiring expensive laboratory experiments — a breakthrough that could dramatically…
The most widely cited statistic from a new MIT report has been deeply misunderstood. While headlines trumpet that “95% of generative AI pilots at companies are failing,” the report actually reveals something far more remarkable: the fastest and most successful enterprise technology adoption in corporate history is happening right under executives’ noses.
The study, released this week by…
Inside Walmart’s AI security stack: How a startup mentality is hardening enterprise-scale defense
August 22, 2025
VentureBeat recently sat down (virtually) with Jerry R. Geisler III, Executive Vice President and Chief Information Security Officer at Walmart Inc., to gain insights into the cybersecurity challenges the world’s largest retailer faces as AI becomes increasingly…
VB AI Impact Series: Can you really govern multi-agent AI?
August 21, 2025
In a Nutshell
VentureBeat’s AI Impact Series discussed deploying multi-agent AI systems with SAP and Agilent. They focus on scaling AI agents safely, integrating AI across the organization, governance frameworks, agent integration challenges, data layers importance…
CodeSignal Inc., the San Francisco-based skills assessment platform trusted by Netflix, Meta, and Capital One, launched Cosmo on Wednesday, a mobile learning application that transforms spare minutes into career-ready skills through artificial intelligence-powered micro-courses.
The app represents a strategic pivot for CodeSignal, which built its reputation assessing technical talent for major…