AI Agents Accelerate Catalyst Discovery, Bridging Computational and Experimental Gaps
Computational chemist Varinia Bernales—who left Dow Chemical in Michigan for the University of Toronto in 2023—recalls the grind at Dow: “The day didn’t have enough hours.” She spent hours crafting input files, submitting jobs, and debugging failures. Experimental colleagues had a flood of ideas but lacked the bandwidth to test them all. Bernales now develops El Agente, an AI agent that, she says, “fully replaces … all the work I did at Dow Chemical.” The tool stitches together disparate software, pinpointing intricate transition‑state structures in under twenty minutes—a task that previously took days.
The wider community is exploring how AI can compress the entire catalyst life cycle. Ted Sargent of Northwestern University points out that moving a catalyst from laboratory discovery to industrial deployment can take ten to twenty‑five years. His group has already harnessed AI to uncover a copper‑indium electrocatalyst that converts carbon dioxide to propylene with the highest Faradaic efficiency reported to date. By combining automated experimentation, expert insight, and machine‑learning models, the team finished the work in a matter of days.
Central to these gains is the flood of high‑quality, standardized data. Initiatives like Open Catalyst, Nomad, and the Open Quantum Materials Database (OQMD) host millions of DFT calculations. The Open Catalyst Experiments 2024 dataset, for instance, aggregates results from over 260 million DFT runs, 572 samples, and thousands of experiments characterized by X‑ray fluorescence and diffraction. It also catalogues 441 gas‑diffusion electrodes used for CO₂ reduction and hydrogen evolution, enabling researchers to train models and then validate predictions on full‑scale industrial rigs.
Machine‑learning interatomic potentials (MLIPs) have supplanted the most costly DFT simulations for many systems. Fernanda Duarte of Oxford explains that an MLIP can deliver a calculation in 25 seconds that would otherwise take hours. Trained on a few hundred DFT data points for a specific reaction, these models maintain high accuracy. Duarte cautions that GPU scarcity and expense remain obstacles, and that remote data‑center access can still be costly.
AI’s chemistry breakthroughs also depend on mining the literature for data. Jacqueline Cole of Cambridge created ChemDataExtractor v2, a chemistry‑aware natural‑language‑processing tool that extracts substance names and properties from scientific papers. The tool has yielded a database of 15,755 photocatalysts with water‑splitting activity, mined from 47,357 papers.
Yet experimental validation and scale‑up remain the bottlenecks. Bernales notes that at Dow she “did a lot of detective work trying to understand why something didn’t work.” She argues that automation could reduce this empirical trial and error. Sargent proposes that recording every material dispensed into a large‑scale flask—along with temperature, microscopy, and other data—would let AI uncover hidden process variables and accelerate scale‑up.
The industry is also tackling data‑sharing hurdles. Jahed Abed of Google X suggests that an open database and standardized competition could elevate the entire field. He points out that the chemical industry often guards data, leaving many failures unreported. Abed calls for open‑sharing standards that protect sensitive information while enabling model training.
Funding bodies are beginning to back the infrastructure required for AI‑driven catalyst research. In the United States, the Department of Energy’s Genesis Mission and the National Science Foundation’s $1.5 billion X‑Labs program could support specialized experimental centers. In the United Kingdom, the University of Liverpool’s £100 million AI Materials Hub for Innovation is a comparable initiative.
The convergence of AI agents, expansive open datasets, and automated experimentation is already delivering tangible results. Still, the field needs more comprehensive experimental data, standardized recording practices, and affordable computational resources. If these challenges are met, AI could cut the decade‑long journey from catalyst discovery to industrial deployment, reshaping a wide array of chemical processes.