Empirical Research Assistance automates scientific coding

May 21, 2026

Empirical Research Assistance, a system developed by researchers at Google and Harvard, automatically writes and refines scientific software for scorable research tasks. Tests showed it could outperform expert-built programs across problems including COVID-19 forecasting, neural modeling, and single-cell RNA sequencing analysis.

Researchers at Google and Harvard have developed Empirical Research Assistance, an Artificial Intelligence system that automatically writes and improves scientific software for specialized research tasks. The system targets what the team calls “empirical software,” custom-built programs designed to maximize performance on a scientific task that can be measured by a numerical score. Such software plays a central role in modern science, where researchers rely on code to test hypotheses, interpret data, and optimize models for problems like weather prediction, disease forecasting, and protein structure prediction.

Empirical Research Assistance is designed to automate the full cycle of scientific software design and refinement, a process that often takes months or even years for human experts. It combines the Google Gemini large language model with a search strategy that explores and refines many possible code variations. Starting from baseline code for a specific problem, the system proposes modifications such as adding components or swapping algorithms, then evaluates whether those changes improve a predefined quality score. It uses tree search, a method also used in systems like AlphaGo, to decide which approaches to pursue and which to discard.

The system can also incorporate research ideas from papers and textbooks, either supplied directly by a user or retrieved automatically, and fold those ideas into later versions of the code. That design allows it to recombine existing concepts in ways that may uncover promising solutions that researchers would be unlikely to test manually. The work was co-led by Michael Brenner, Catalyst Professor of Applied Mathematics and Physics at the Harvard John A. Paulson School of Engineering and Applied Sciences and a Google research scientist, along with Shibl Mourad from Google DeepMind. Harvard Ph.D. students Qian-Ze Zhu, Ryan Krueger, and Sarah Martinson contributed as Google student researchers while working in Brenner’s group.

In testing, the system was applied to several scientific problems. Zhu used it to predict the activity of more than 70,000 neurons in the brain of a zebrafish and compare the results against actual neural data. In one experiment, the team prompted Empirical Research Assistance to use an existing neuron-modeling library to build more physically accurate simulations of neural activity, a task that would otherwise have required substantial manual effort to learn and tune. Zhu said methods that previously might take a week to implement can now be run in parallel in a few hours.

On one test, the ERA system generated 14 models for predicting COVID-19 hospitalizations that outperformed the best U.S. Centers for Disease Control models used during the pandemic. In another experiment, ERA discovered four new methods for integrating single-cell RNA sequencing datasets, beating top human-designed approaches. The researchers say the system could cut exploration time from months to hours or days, potentially freeing scientists to focus more on defining important questions and tackling creative and critical research challenges.

Source

78

Impact Score

Latest News

Google unveils new Artificial Intelligence models and personal agents

May 21, 2026

Google used its I/O developer conference to introduce updated Gemini models and personal Artificial Intelligence agents aimed at competing more aggressively with OpenAI and Anthropic. The push centers on stronger models, wider product integration, and a broader enterprise and developer pitch.

AMD launches Ryzen Artificial Intelligence Max 400 series processors

May 21, 2026

AMD has introduced the Ryzen Artificial Intelligence Max 400 series as a refresh of its Strix Halo platform for Artificial Intelligence development systems. The update centers on expanded memory capacity, higher clocks, and a faster neural processing unit.

Policymakers weigh pause on Artificial Intelligence data center construction

May 21, 2026

Federal, state, and local officials are moving to slow or condition large data center development as concerns grow over electricity costs, grid strain, environmental effects, and labor standards. Proposed moratoriums and tax incentive changes are creating new uncertainty for developers, hyperscalers, and financiers.

Artificial Intelligence risk becomes a core private equity deal issue

May 21, 2026

Private equity buyers are treating Artificial Intelligence as a growing source of legal, operational, and valuation risk. Informal adoption, weak governance, and rising regulatory scrutiny are pushing Artificial Intelligence diligence beyond a narrow software review.

European Union delays key Artificial Intelligence Act obligations

May 20, 2026

European Union lawmakers have agreed to revise the Artificial Intelligence Act, delaying major high-risk compliance obligations and easing some overlapping requirements. The changes give businesses more time to prepare while preserving the law’s core framework for high-risk systems and transparency rules.

Empirical Research Assistance automates scientific coding

78

Impact Score

Latest News

Google unveils new Artificial Intelligence models and personal agents

AMD launches Ryzen Artificial Intelligence Max 400 series processors

Policymakers weigh pause on Artificial Intelligence data center construction

Artificial Intelligence risk becomes a core private equity deal issue

European Union delays key Artificial Intelligence Act obligations

Contact Us