AI & RoboticsNews

Your AI models are failing in production—Here’s how to fix model selection

Enterprises need to know if the models that power their applications and agents work in real-life scenarios. This type of evaluation can sometimes be complex because it is hard to predict specific scenarios. A revamped version of the RewardBench benchmark looks to give organizations a better idea of a model’s real-life performance. The Allen Institute of AI (Ai2) launched RewardBench 2, an…
Read more
AI & RoboticsNews

Enterprise alert: PostgreSQL just became the database you can’t ignore for AI applications

The open-source PostgreSQL (sometimes also referred to as Postgres) is apparently a very hot commodity for big enterprise data platform vendors. Snowflake is acquiring privately-held PostgreSQL provider Crunchy Data, in a deal that is reportedly valued at $250 million. The acquisition comes barely two weeks after Snowflake’s rival Databricks acquired serverless PostgreSQL vendor Neon. The pair…
Read more
AI & RoboticsNews

Model Context Protocol: A promising AI integration layer, but not a standard (yet)

In the past couple of years as AI systems have become more capable of not just generating text, but taking actions, making decisions and integrating with enterprise systems, they have come with additional complexities. Each AI model has its own proprietary way of interfacing with other software. Anthropic’s Model Context Protocol (MCP) is one of the first attempts to fill this gap. It proposes a…
Read more
AI & RoboticsNews

FLUX.1 Kontext enables in-context image generation for enterprise AI pipelines

Black Forest Labs (BFL), the startup founded by the creators of the popular Stable Diffusion model, has launched a new image generation model called FLUX.1 Kontext. This model not only generates and edits photos, but also allows users to modify them with both text and other images. The company also announced its new BFL Playground, where people can try out BFL’s models before letting them loose…
Read more