Even as large language models (LLMs) become ever more sophisticated and capable, they continue to suffer from hallucinations: offering up inaccurate information, or, to put it more harshly, lying. Mayo Clinic, one of the top-ranked hospitals in the U.S., has adopted a novel technique to address this challenge. To succeed, the medical facility must overcome the limitations of retrieval-augmented…
How Yelp reviewed competing LLMs for correctness, relevance and tone to develop its user-friendly AI assistant
March 10, 2025
The review app Yelp has provided helpful information to diners and other consumers for decades. It had experimented with machine learning since its early years. During the recent explosion in AI technology, it was still encountering stumbling blocks as it worked to employ…
Once upon a time, software ate the world. Now, AI is here to digest what’s left. The old model of computing, where apps ruled, marketplaces controlled access and platforms took their cut, is unraveling. What’s emerging is an AI-first world where software functions…
Thomas Wolf, cofounder of AI company Hugging Face, has issued a stark challenge to the tech industry’s most optimistic visions of artificial intelligence, arguing that today’s AI systems are fundamentally incapable of delivering the scientific revolutions their creators promise. In a provocative blog post published on his personal website this morning, Wolf directly confronts the widely…
A standard, open framework for building AI agents is coming from Cisco, LangChain and Galileo
March 7, 2025
One goal for an agentic future is for AI agents from different organizations to freely and seamlessly talk to one another. But getting to that point requires interoperability, and these agents may have been built with different LLMs, data frameworks and code. A group of…
Anthropic just launched a new platform that lets everyone in your company collaborate on AI — not just the tech team
March 7, 2025
Anthropic has launched a significant overhaul to its developer platform, introducing team collaboration features and extended reasoning capabilities for its Claude AI assistant that aim to solve major pain points for organizations implementing AI solutions.
The upgraded…
Enhancing AI agents with long-term memory: Insights into LangMem SDK, Memobase and the A-MEM Framework
March 6, 2025
AI agents can automate many tasks that enterprises want to perform. One downside, though, is that they tend to be forgetful. Without long-term memory, agents must either finish a task in a single session or be constantly re-prompted.
So, as enterprises continue to explore use cases for AI agents and how to implement them safely, the companies enabling development of agents must consider how to…
Nvidia today announced GTC 2025, the world’s premier AI conference, will return March 17 to March 21 to San Jose, California, with an estimated 25,000 in-person attendees and 300,000 virtually. Nvidia CEO Jensen Huang‘s keynote is expected to be so popular that it will…
SimilarWeb‘s latest Global AI Tracker report reveals dramatic shifts in the AI landscape, painting a clear picture of market winners and losers. The comprehensive report tracks traffic patterns across various AI tool categories, providing crucial insights for industry…
The release of OpenAI GPT-4.5 has been somewhat disappointing, with many pointing out its insane price point (about 10 to 20X more expensive than Claude 3.7 Sonnet and 15 to 30X more costly than GPT-4o).
However, given that this is OpenAI’s largest and most powerful non-reasoning model, it is worth considering its strengths and the areas where it shines.
Better knowledge and alignment
There is…