ChatGPT Images adds thinking capability

OpenAI has upgraded ChatGPT Images with a new thinking mode that can search the internet, generate multiple images, and verify outputs before finalizing results. The update also improves text rendering, dense compositions, multilingual support, and style flexibility.

OpenAI has released a major update to its Artificial Intelligence image generator, ChatGPT Images. The 2.0 upgrade, introduced in a blog post on April 21, adds “thinking capabilities” for the first time. The new function lets the imaging model search the internet for real-time information using a single prompt, then create multiple images and double-check its own outputs.

OpenAI said the ability to think allows the system to do “more of the heavy lifting” between idea and image, with stronger accuracy and visual cohesion. The update also reflects more current information because of a knowledge cut-off of December last year, when OpenAI rolled out its last big Images update. ChatGPT’s handling of more sophisticated imagery is also supported by better rendering of small text and iconography, along with improved performance on dense compositions.

In thinking mode, users can create up to eight images at once, a first for ChatGPT. OpenAI positioned this as useful for more complex creative work such as social media graphics in different aspect ratios and languages, or a family of poster concepts. The model also puts more emphasis on languages beyond English and those using Latin script. It now supports Japanese, Korean, Chinese, Hindi and Bengali.

OpenAI said photo outputs now capture the “tiny flaws that add realism,” and the tool has become more capable of depicting a wider range of styles. The company highlighted cinematic stills, manga and pixel art as examples, aimed at uses including marketing and gaming. A wide array of aspect ratios is available, ranging from 3:1 to 1:3.

The upgraded Images is now available to all ChatGPT users. Coders can access it through the Codex app, while developers and businesses can use the gpt-image-2 model in the API, where pricing depends on image quality and resolution. Advanced outputs with thinking are available to Plus, Pro and Business users. OpenAI also said that in the API, outputs over 2K are in beta and may produce inconsistent results.

52

Impact Score

YouTube expands deepfake detection to Hollywood talent

YouTube is opening its likeness protection system to actors, athletes, musicians and creators beyond its own platform. The move gives public figures a way to flag and request removal of damaging Artificial Intelligence-generated replicas while YouTube weighs broader rules and possible future monetization.

Adobe plans outcome-based pricing for Artificial Intelligence agents

Adobe is positioning its Artificial Intelligence agents around performance-based pricing, charging only when the software completes useful work. The approach points to a more results-oriented model for selling generative Artificial Intelligence tools to business customers.

Tech firms commit billions to Artificial Intelligence infrastructure

Amazon, OpenAI, Nvidia, Meta, Google and others are signing increasingly large cloud, chip and data center agreements as demand for Artificial Intelligence infrastructure accelerates. The latest wave of deals spans investments, compute purchases, chip supply agreements and data center buildouts.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.