ChatGPT Images adds thinking capability

OpenAI has upgraded ChatGPT Images with a new thinking mode that can search the internet, generate multiple images, and verify outputs before finalizing results. The update also improves text rendering, dense compositions, multilingual support, and style flexibility.

OpenAI has released a major update to its Artificial Intelligence image generator, ChatGPT Images. The 2.0 upgrade, introduced in a blog post on April 21, adds “thinking capabilities” for the first time. The new function lets the imaging model search the internet for real-time information using a single prompt, then create multiple images and double-check its own outputs.

OpenAI said the ability to think allows the system to do “more of the heavy lifting” between idea and image, with stronger accuracy and visual cohesion. The update also reflects more current information because of a knowledge cut-off of December last year, when OpenAI rolled out its last big Images update. ChatGPT’s handling of more sophisticated imagery is also supported by better rendering of small text and iconography, along with improved performance on dense compositions.

In thinking mode, users can create up to eight images at once, a first for ChatGPT. OpenAI positioned this as useful for more complex creative work such as social media graphics in different aspect ratios and languages, or a family of poster concepts. The model also puts more emphasis on languages beyond English and those using Latin script. It now supports Japanese, Korean, Chinese, Hindi and Bengali.

OpenAI said photo outputs now capture the “tiny flaws that add realism,” and the tool has become more capable of depicting a wider range of styles. The company highlighted cinematic stills, manga and pixel art as examples, aimed at uses including marketing and gaming. A wide array of aspect ratios is available, ranging from 3:1 to 1:3.

The upgraded Images is now available to all ChatGPT users. Coders can access it through the Codex app, while developers and businesses can use the gpt-image-2 model in the API, where pricing depends on image quality and resolution. Advanced outputs with thinking are available to Plus, Pro and Business users. OpenAI also said that in the API, outputs over 2K are in beta and may produce inconsistent results.

52

Impact Score

NVIDIA and Doosan broaden physical Artificial Intelligence partnership

NVIDIA and Doosan Group are expanding work across robotics, autonomous equipment, power infrastructure and advanced materials. The partnership links NVIDIA accelerated computing platforms with Doosan businesses serving industrial automation, energy systems and data center hardware.

Chatbot liability suits test Artificial Intelligence safety law

A Florida lawsuit targeting ChatGPT’s maker signals a new product liability threat for Artificial Intelligence companies. The fight could turn on unsettled questions about platform immunity, speech protections, causation, and federal safety rules.

Canada pushes Artificial Intelligence sovereignty strategy

Canada has unveiled an Artificial Intelligence for All strategy focused on reducing reliance on foreign cloud and Artificial Intelligence providers. The plan mirrors the EU’s new sovereignty push and sets targets for adoption, infrastructure and jobs.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.