Google DeepMind Unveils AlphaEvolve, an LLM Agent That Solves Real-World Problems

Google DeepMind’s AlphaEvolve uses large language models to crack both unsolved scientific puzzles and optimize crucial real-world systems, advancing Artificial Intelligence’s impact on mathematics and computing.

Google DeepMind has introduced AlphaEvolve, a groundbreaking agent leveraging the Gemini 2.0 family of large language models to solve long-standing challenges in mathematics, computer science, and real-world optimization. Unlike prior approaches that relied on language models solely for code suggestion, AlphaEvolve employs an iterative process: it generates numerous code candidates, evaluates their outcomes, discards less effective attempts, and refines promising solutions until it identifies algorithms superior to existing ones. This technique has already led to significant advancements, such as improving Google’s global server job allocation software, ultimately freeing up 0.7% of the company’s total computing capacity—a notable gain at Google’s operational scale.

The new agent builds on Google DeepMind´s lineage of Artificial Intelligence research, following notable predecessors like AlphaTensor, which broke decades-old records in matrix multiplication, and AlphaDev, which accelerated basic computational tasks. AlphaEvolve, however, transcends earlier tools by being able to generate hundreds of lines of code and address vastly more varied and complex problems. It utilizes both the highly efficient Gemini 2.0 Flash and the more powerful Gemini 2.0 Pro models: Flash is used for rapid code generation, while Pro intervenes for particularly challenging cases. This multi-stage evolutionary approach mirrors a process of ´survival of the fittest´ for algorithms, ensuring continued improvements through each cycle until no further progress is detected.

AlphaEvolve’s successes span both theoretical and practical domains. It outperformed human-designed algorithms and previous DeepMind models in matrix multiplication, enhancing computation speeds for various matrix sizes and generalizing solutions for broader numerical contexts. Testing it on over 50 mathematical challenges, AlphaEvolve either matched or surpassed existing results in the majority of cases, with notable breakthroughs in fields like Fourier analysis and number theory. Its real-world applications extend beyond mathematics, including optimizing data center operations, cutting power consumption of tensor processing units, and even expediting the training of the Gemini language models themselves. While AlphaEvolve currently excels in problems amenable to automated evaluation, it falls short in areas requiring subjective human judgment. Despite lacking theoretical transparency in its algorithmic reasoning, the tool’s efficiency and broad applicability herald a transformative shift in how researchers approach complex scientific and computational problems, with Google DeepMind pledging continued development and expansion of its potential uses.

87

Impact Score

Gen and Intel push on device artificial intelligence deepfake detection

Cyber safety company Gen is partnering with Intel to bring on device artificial intelligence deepfake detection to consumer hardware, targeting scams that hide inside long form video and synthetic audio. New research from Gen suggests most deepfake enabled fraud now emerges during extended viewing sessions rather than through obvious phishing links.

Asic scaling challenges Nvidia’s artificial intelligence gpu dominance

Between 2022 and 2025, major vendors increased artificial intelligence chip output primarily by enlarging hardware rather than fundamentally improving individual processors. Nvidia and its rivals are presenting dual chip cards as single units to market apparent performance gains.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.