What´s next for artificial intelligence and mathematics

Artificial Intelligence is accelerating progress in mathematics, nudging the field toward breakthroughs—yet true creative insight remains a human domain.

The landscape of mathematics is experiencing a rapid transformation as artificial intelligence becomes increasingly capable at tackling complex problems. The US Defense Advanced Research Projects Agency (DARPA) has initiated the expMath program to revolutionize mathematical research and turbocharge the traditionally slow pace of mathematical breakthroughs. Their vision centers on the creation of an artificial intelligence ´coauthor,´ a tool that can decompose monumental math problems into manageable, solvable parts. The hope is that artificial intelligence will not just assist with routine calculations but help unlock discoveries previously deemed unreachable.

Recent years have seen large reasoning models (LRMs) such as OpenAI’s o3 and Anthropic’s Claude 4 Thinking set new benchmarks by solving high-level math problems, including those found on the American Invitational Mathematics Examination (AIME) and the International Math Olympiad. Hybrid models like AlphaProof—developed by Google DeepMind—combine language models with advanced game-playing systems, achieving feats previously reserved for top human competitors. AlphaEvolve, another DeepMind creation, has even outperformed humans on over 50 unsolved math and computer science problems. However, these successes largely draw from the repetitive nature and recognizable tricks in competition problems, which differ vastly from the exploratory and open-ended challenges encountered in mathematical research.

This distinction has prompted the development of new benchmarks like Epoch AI’s FrontierMath, designed in collaboration with mathematicians to push artificial intelligence further by introducing entirely novel problems that demand hours of expert-level reasoning. While leading language models achieve close to perfect scores on standardized tests, they still struggle to surpass 20% on these new, domain-driven challenges, exposing current technological limits. Researchers like Sergei Gukov at Caltech have begun innovating with approaches that condense sequences of mathematical reasoning into ‘supermoves,’ allowing reinforcement-learning systems to check entire attack directions on long-standing conjectures such as the Andrews-Curtis problem, thereby saving years of human effort.

Yet, a key question persists: can artificial intelligence deliver genuine mathematical insight, or does it remain a sophisticated assistant? Advanced tools like AlphaEvolve and Meta’s PatternBoost support human exploration by rapidly generating and evaluating ideas—functioning as a creative brainstorming partner. Mathematicians such as Geordie Williamson envision a future where artificial intelligence helps unearth mathematical objects that have the potential to shape the discipline, but emphasize that intuition and conceptual breakthroughs, like inventing the icosahedron, remain uniquely human traits. Ultimately, artificial intelligence is poised as an invaluable scout and collaborator, accelerating progress in mathematics—but the core of true discovery still lies with human curiosity and ingenuity.

80

Impact Score

Artificial Intelligence tumour testing aims to personalize cancer treatment

A UK-funded cancer testing platform is using living tumour replicas and Artificial Intelligence analysis to identify which drugs are most likely to work before treatment starts. Researchers say the approach could reduce ineffective chemotherapy and improve decisions for patients with aggressive cancers.

Figure advances home robotics with living room cleanup

Figure says its Helix 02 humanoid can now autonomously tidy a living room, marking a step beyond kitchen-focused tasks. The robotics roundup also highlights a DJI vacuum security flaw, new object-finding research, and notable industry moves.

Microsoft launches Copilot Health in the US

Microsoft has introduced Copilot Health as a protected space inside Copilot that combines medical records, wearable data and lab results into personalised health insights. The service is launching first for adults in the US with strong privacy controls and a limited initial rollout.

Tesla plans terafab for Artificial Intelligence chips

Tesla is moving toward a large-scale chip manufacturing project to support its autonomous driving roadmap. Elon Musk said the terafab effort for Artificial Intelligence chips will launch in seven days and may involve Intel, TSMC and Samsung.

Timeline traces evolution, civilisation and planetary stewardship

A sweeping chronology links cosmology, evolution, human history and modern environmental risk in a single long view of the human condition. The sequence culminates in contemporary debates over climate change, biodiversity loss and artificial intelligence governance.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.