Browsing tag

large language models

AI & Robotics News

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

July 16, 2025

A new study by researchers at Google DeepMind and University College London reveals how large language models (LLMs) form, maintain and lose confidence in their answers. The findings reveal striking similarities between the cognitive biases of LLMs and humans, while also highlighting stark differences. The research reveals that LLMs can be overconfident in their own answers yet quickly lose that…

AI & Robotics News

The 3 biggest bombshells from this week’s AI extravaganza

May 27, 2025

Basketball has March Madness. Tech has the Consumer Electronics Show. AI has been waiting for its big moment—and this week may finally be it. With Microsoft’s Build and Google’s I/O developer conferences happening back-to-back, it was already primed to be a big week.

AI & Robotics News

Not everything needs an LLM: A framework for evaluating when AI makes sense

May 5, 2025

Question: What product should use machine learning (ML)? Project manager answer: Yes. Jokes aside, the advent of generative AI has upended our understanding of what use cases lend themselves best to ML. Historically, we have always leveraged ML for repeatable, predictive…

AI & Robotics News

30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

April 29, 2025

Researchers from UCLA and Meta AI have introduced d1, a novel framework using reinforcement learning (RL) to significantly enhance the reasoning capabilities of diffusion-based large language models (dLLMs). While most attention has focused on autoregressive models like GPT, dLLMs offer unique advantages. Giving them strong reasoning skills could unlock new efficiencies and applications for…

AI & Robotics News

SWiRL: The business case for AI that thinks like your best problem-solvers

April 23, 2025

Researchers from Stanford University and Google DeepMind have unveiled Step-Wise Reinforcement Learning (SWiRL), a technique designed to enhance the ability of large language models (LLMs) to tackle complex tasks requiring multi-step reasoning and tool use. As the interest…

AI & Robotics News

When AI reasoning goes wrong: Microsoft Research shows more tokens can mean more problems

April 16, 2025

In a Nutshell Microsoft Research finds that inference-time scaling methods for large language models don’t universally improve performance. Varying benefits, token inefficiency, and cost unpredictability challenge assumptions. Verification mechanisms enhance model…

AI & Robotics News

New open source AI company Deep Cogito releases first models and they’re already topping the charts

April 9, 2025

Deep Cogito, a new AI research startup based in San Francisco, officially emerged from stealth today with Cogito v1, a new line of open source large language models (LLMs) fine-tuned from Meta’s Llama 3.2 and equipped with hybrid reasoning capabilities — the ability to answer quickly and immediately, or “self-reflect” like OpenAI’s “o” series and DeepSeek R1. The company aims to push…

AI & Robotics News

AI lie detector: How HallOumi’s open-source approach to hallucination could unlock enterprise AI adoption

April 4, 2025

In the race to deploy enterprise AI, one obstacle consistently blocks the path: hallucinations. These fabricated responses from AI systems have caused everything from legal sanctions for attorneys to companies being forced to honor fictitious policies. Organizations have…

AI & Robotics News

AI is growing faster than companies can secure it, warn industry leaders

September 2, 2024

At the DataGrail Summit 2024 this week, industry leaders delivered a stark warning about the rapidly advancing risks associated with AI. Dave Zhou, CISO of Instacart, and Jason Clinton, CISO of Anthropic, highlighted the urgent need for robust security measures to keep pace…