Researchers from UCLA and Meta AI have introduced d1, a novel framework using reinforcement learning (RL) to significantly enhance the reasoning capabilities of diffusion-based large language models (dLLMs). While most attention has focused on autoregressive models like GPT, dLLMs offer unique advantages. Giving them strong reasoning skills could unlock new efficiencies and applications for…
An AI assistant that unequivocally agrees with everything you say and supports you — even your most outlandish and obviously false, misguided or straight-up bad ideas — sounds like something out of a cautionary sci-fi short story from Philip K. Dick. But it appears to be…
Chinese e-commerce and web giant Alibaba’s Qwen team has officially launched a new series of open source AI large language multimodal models known as Qwen3 that appear to be among the state-of-the-art for open models, and approach performance of proprietary models from the…
January 2025 shook the AI landscape. The seemingly unstoppable OpenAI and the powerful American tech giants were shocked by what we can certainly call an underdog in the area of large language models (LLMs). DeepSeek, a Chinese firm not on anyone’s radar, suddenly challenged OpenAI. It is not that DeepSeek-R1 was better than the top models from American giants; it was slightly behind in terms of…
Ziff Davis and IGN sue OpenAI for copyright infringement
April 28, 2025
In one of the more common disputes of modern AI, Ziff Davis, IGN Entertainment and Everyday Health Media have sued Open AI for copyright infringement.
The lawsuit from the media companies alleged copyright infringement, violations of the Digital Millennium Copyright Act…
In my first stint as a machine learning (ML) product manager, a simple question inspired passionate debates across functions and leaders: How do we know if this product is actually working? The product in question that I managed catered to both internal and external…
OpenAI makes ChatGPT’s image generation available as API
April 24, 2025
People can now natively incorporate Studio Ghibli-inspired pictures generated by ChatGPT into their businesses. OpenAI has added the model behind its wildly popular image generation tool, used in ChatGPT, to its API.
The gpt-image-1 model will allow developers and enterprises to “integrate high-quality, professional-grade image generation directly into their own tools and platforms.”
“The…
Former DeepSeeker and collaborators release new method for training reliable AI agents: RAGEN
April 24, 2025
2025 was, by many expert accounts, supposed to be the year of AI agents — task-specific AI implementations powered by leading large language and multimodal models (LLMs) like the kinds offered by OpenAI, Anthropic, Google, and DeepSeek.
But so far, most AI agents remain…
Google continues to bring its flagship AI models to its productivity apps, expanding its Gemini features.
The company today announced several updates to its Workspace products, including the addition of Audio Overviews and new streamlined methods for tracking meetings.
Audio…
Researchers from Stanford University and Google DeepMind have unveiled Step-Wise Reinforcement Learning (SWiRL), a technique designed to enhance the ability of large language models (LLMs) to tackle complex tasks requiring multi-step reasoning and tool use.
As the interest in AI agents and LLM tool use continues to increase, this technique could offer substantial benefits for enterprises looking…