AI & RoboticsNews

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

The adoption of interoperability standards, such as the Model Context Protocol (MCP), can provide enterprises with insights into how agents and models function outside their walled confines. However, many benchmarks fail to capture real-life interactions with MCP.  Salesforce AI Research developed a new open-source benchmark it calls MCP-Universe, which aims to track LLMs as these interact with…
Read more
AI & RoboticsNews

Meta is partnering with Midjourney and will license its technology for ‘future models and products’

Even three years after its debut, with ever increasing competition in the AI image and video generation space, Midjourney, the bootstrapped San Francisco startup, remains the “gold standard” for its 20 million users — including us here at VentureBeat, where we use it to generate the “header” art to many of our articles. Apparently, the leaders of Facebook and Instagram parent company…
Read more
AI & RoboticsNews

Chan Zuckerberg Initiative’s rBio uses virtual cells to train AI, bypassing lab work

The Chan Zuckerberg Initiative announced Thursday the launch of rBio, the first artificial intelligence model trained to reason about cellular biology using virtual simulations rather than requiring expensive laboratory experiments — a breakthrough that could dramatically accelerate biomedical research and drug discovery. The reasoning model, detailed in a research paper published on bioRxiv…
Read more
AI & RoboticsNews

VB AI Impact Series: Can you really govern multi-agent AI?

In a Nutshell VentureBeat’s AI Impact Series discussed deploying multi-agent AI systems with SAP and Agilent. They focus on scaling AI agents safely, integrating AI across the organization, governance frameworks, agent integration challenges, data layers importance, orchestration layer management, and privacy and security concerns in enterprise agentic activations. Monitoring and improvement…
Read more