Google´s Gemini artificial intelligence now converts photos to videos for paid users

Google expands Gemini with the ability for paid users to turn photos into high-quality, 8-second videos with sound, advancing multimedia capabilities in its artificial intelligence portfolio.

Alphabet Inc., the parent company of Google, has launched a new feature for its Gemini artificial intelligence assistant, enabling paid users to transform photos into short, animated videos. Following earlier limited testing, the feature is now fully integrated into the Gemini chat interface. Users can generate 8-second, sound-enabled videos in MP4 format, rendered at 720p resolution and a 16:9 aspect ratio, by submitting a single photo along with a brief text description.

This upgrade marks a notable advancement in Gemini´s artificial intelligence toolbox, directly responding to the growing appetite for dynamic, shareable multimedia content on social media and digital platforms. By embedding this capability within the existing chat interface, Google simplifies the creative process for subscribers, allowing seamless conversion from static photography to engaging audio-visual content. The move highlights Google´s broader ambition to enhance the functionality and user appeal of its artificial intelligence offerings, potentially drawing more users towards paid tiers and boosting creative productivity.

The new feature is strategically positioned to strengthen Google´s competitiveness in a rapidly changing tech landscape, where artificial intelligence-driven multimedia tools are increasingly shaping user experiences. With the ability to produce video content from photos, content creators, marketers, and casual users alike now have a practical tool for quick, eye-catching storytelling. The output is optimized for social media sharing, given the concise length and high visual quality.

Google has also outlined guardrails for ethical and policy compliance. The artificial intelligence backend prohibits generating videos using images of public figures and forbids content promoting violence, incitement, or group attacks. Despite these safeguards, tests have revealed ongoing challenges—such as facial features or ethnicity shifting unexpectedly in photo-based human animations, and difficulties with accurately animating complex movements. Simpler animations, like natural scenes or static objects with added motion, are more reliably rendered. In response, Google acknowledged that facial animation technology based on single images is still maturing and has committed to ongoing improvements, particularly in areas like realistic human motion and facial accuracy.

Overall, the enhancement shows Google´s persistent drive to push the boundaries of artificial intelligence media generation, underlining a commitment to ongoing innovation and staying ahead in the artificial intelligence arms race. By listening to user feedback and iterating on challenging aspects, Google aims to solidify Gemini’s reputation as a versatile and forward-looking artificial intelligence tool for the evolving demands of digital communication.

57

Impact Score

Artificial Intelligence divides employers as hiring and headcount shift

U.S. hiring beat expectations in April, but employers remain split on whether Artificial Intelligence should drive layoffs, productivity gains, or internal redeployment. At the same time, candidate use of Artificial Intelligence is outpacing employer adoption in hiring, adding new pressure to screening and entry-level recruiting.

What businesses need to know about the EU cyber resilience act

The EU cyber resilience act is turning product cybersecurity into a legal requirement for companies that sell digital products into the European Union. A key compliance milestone arrives in September 2026, well before the full regulation takes effect in 2027.

Claude Mythos and cyber insurance’s next inflection point

Claude Mythos is being treated by governments and regulators as a potential systemic cyber risk with implications for financial stability and insurance markets. Its emergence is intensifying pressure on insurers to clarify whether Artificial Intelligence-enabled cyber losses are covered, excluded, or require new stand-alone products.

OpenAI expands ChatGPT ads with self-serve manager

OpenAI is widening its ChatGPT ads pilot with a beta self-serve Ads Manager, new bidding options and broader measurement tools. The push signals a deeper move into advertising as the company expands the program into several international markets.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.