It started with the announcement of OpenAI’s o1 model in Sept. 2024, but really took off with the DeepSeek R1 release in Jan. 2025.
Now, it seems that most major AI model providers and trainers are in a new race to deliver better, faster, and cheaper “reasoning” AI language models — that is, ones that maybe take a little longer to respond to a human user, but ideally do so with better…
In a Nutshell
Writer, an Enterprise AI company, launched AI HQ to enable businesses to bridge the gap between AI potential and real-world results. The platform features autonomous agents for complex workflows, self-evolving models, and a $1.9 billion valuation with unique…
New open source AI company Deep Cogito releases first models and they’re already topping the charts
April 9, 2025
Deep Cogito, a new AI research startup based in San Francisco, officially emerged from stealth today with Cogito v1, a new line of open source large language models (LLMs) fine-tuned from Meta’s Llama 3.2 and equipped with hybrid reasoning capabilities — the ability to…
As tech giants declare their AI releases open — and even put the word in their names — the once insider term “open source” has burst into the modern zeitgeist. During this precarious time in which one company’s misstep could set back the public’s comfort with AI by a decade or more, the concepts of openness and transparency are being wielded haphazardly, and sometimes dishonestly, to…
Whether by automating tasks, serving as copilots or generating text, images, video and software from plain English, AI is rapidly altering how we work. Yet, for all the talk about AI job, widespread workforce displacement has yet to happen.
It seems likely that this could be…
Baidu delivers new LLMs ERNIE 4.5 and ERNIE X1 undercutting DeepSeek, OpenAI on cost — but they’re not open source (yet)
March 18, 2025
Over the weekend, Chinese web search giant Baidu announced the launch of two new AI models, ERNIE 4.5 and ERNIE X1, a multimodal language model and reasoning model, respectively.
Baidu claims they offer state-of-the-art performance on a variety of metrics, besting…
LLMs Power AI: Exploring Transformer Architecture
February 17, 2025
Today, virtually every cutting-edge AI product and model uses a transformer architecture. Large language models (LLMs) such as GPT-4o, LLaMA, Gemini and Claude are all transformer-based, and other AI applications such as text-to-speech, automatic speech recognition, image generation and text-to-video models have transformers as their underlying technology.
With the hype around AI not likely to…
Meta founder and CEO Mark Zuckerberg, who built the company atop of its hit social network Facebook, finished this week strong, posting a video of himself doing a leg press exercise on a machine at the gym on his personal Instagram (a social network Facebook acquired in…
Artificial intelligence company Cohere unveiled significant updates to its fine-tuning service on Thursday, aiming to accelerate enterprise adoption of large language models. The enhancements support Cohere’s latest Command R 08-2024 model and provide businesses with…
AI is rapidly evolving, poised to transform the workplace in ways that were thought of as science fiction only a few years ago. For example, Google recently updated its Google’s NotebookLM, an AI-powered research assistant and note-taking tool, with Audio Overview to turn documents into audio discussions. Writing in ArsTechnica, Kyle Orland describes how he used this feature to create a…