AI & RoboticsNews

OpenAI makes ChatGPT’s image generation available as API

Top Tech News - ChatGPT’s Image API: Transforming Visuals

People can now natively incorporate Studio Ghibli-inspired pictures generated by ChatGPT into their businesses. OpenAI has added the model behind its wildly popular image generation tool, used in ChatGPT, to its API.

The gpt-image-1 model will allow developers and enterprises to “integrate high-quality, professional-grade image generation directly into their own tools and platforms.”

“The model’s versatility allows it to create images across diverse styles, faithfully follow custom guidelines, leverage world knowledge, and accurately render text — unlocking countless practical applications across multiple domains,” OpenAI said in a blog post.

Pricing for the API separates tokens for text and images. Text input tokens, or the prompt text, will cost $5 per 1 million tokens. Image input tokens will be $10 per million tokens, while image output tokens, or the generated image, will be a whopping $40 per million tokens.

Competitors like Stability AI offer a credit-based system for its API where one credit is equal to $0.01. Using its flagship Stable Image Ultra costs eight credits per generation. Google’s image generation model, Imagen, charges paying users $0.03 per image generated using the Gemini API.

Image generation in one place

OpenAI allowed ChatGPT users to generate and edit images directly on the chat interface in April, a few months after adding image generation into ChatGPT through the GPT-4o model.

The company said image generation in the chat platform “quickly became one of our most popular features.” OpenAI said over 130 million users have accessed the feature and created 700 million photos in the first week alone.

Also Read: ChatGPT Memory Now Remembers All Past Chats Automatically

However, this popularity also presented OpenAI with some challenges. Social media users quickly discovered that they could prompt ChatGPT to generate images inspired by the Japanese animation juggernaut Studio Ghibli, and as a result, my social media feeds were filled with the same photos for the entire weekend. The trend prompted OpenAI CEO Sam Altman to claim the company’s GPUs “are melting.”

OpenAI previously added its image model DALL-E 3 on ChatGPT. That model was a diffusion transformer model rather than the native multimodal understanding that GPT-4o has.

Enterprise use cases

Enterprises want the ability to generate images for their projects, and many don’t want to open a separate application to do so. By adding the image model to its API, OpenAI allows enterprises to connect gpt-image-1 to their own ecosystems.

OpenAI said it’s already seen several enterprises and startups use the model for creative projects, products and experiences, naming several well-known brands in its blog post.

Canva is reportedly exploring ways to integrate gpt-image-1 for its Canva AI and Magic Studio Tools. GoDaddy has already begun experimenting with image generation for customers to create their logos, and Airtable now enables enterprise marketing and creative teams to easily manage asset workflows at scale.

OpenAI said gpt-image-1 will get the same safety guardrails on the API as in ChatGPT. The company said images generated with the model natively include metadata from the Coalition for Content Provenance and Authenticity (C2PA) that labels content as AI-generated and tracks ownership. OpenAI is part of C2PA’s steering committee.

Users can also control content moderation to generate images that best align with their brand.

OpenAI promised that it will not use customer API data, including any images uploaded or generated by gpt-image-1 to train its models.


Author: Emilia David
Source: Venturebeat
Reviewed By: Editorial Team

Related posts
CryptoNews

Cyber Gang Spent $4M on Nightclubs, $9M on Exotic Cars in $263M Crypto Fraud, DOJ Says

CryptoNews

Chinese Public Company DDC Enterprise to Acquire 5,000 BTC for Balance Sheet

CryptoNews

Solana’s dApp Revenue Opportunity: Resilience in a Volatile Market

CryptoNews

Former Rep. McHenry: Gensler's Anti-Crypto Stance Tied to Senate Politics – Crypto News Bitcoin News

Sign up for our Newsletter and
stay informed!

Worth reading...
Google adds more AI tools to its Workspace productivity apps