Ai Engineering 2 min read

Steering Chemical Synthesis via LLM Evaluation in EPFL's Synthegy

EPFL researchers have developed Synthegy, a framework that uses large language models to evaluate and guide traditional computational chemistry algorithms.

Researchers at the École Polytechnique Fédérale de Lausanne (EPFL) have published Synthegy, a framework that integrates large language models into computational chemistry pipelines. Led by Philippe Schwaller and first author Andres M. Bran, the research introduces a steerable synthesis planning architecture. The system applies LLMs to evaluate and guide established algorithmic searches, bridging raw computational throughput with the strategic constraints of human chemists.

Algorithmic Steering via Language

Synthegy positions the language model as a strategic reasoning engine over traditional synthesis software. Chemists describe their constraints using a natural language interface, instructing the system to avoid unnecessary protecting groups or prioritize specific ring formations early in the process. The framework translates these parameters into concrete guidance for retrosynthesis engines like AiZynthfinder and Monte Carlo Tree Search (MCTS) algorithms.

By scoring potential pathways at each branch of the search tree, the system forces the underlying software to explore only strategically viable routes. This approach requires robust methods to evaluate AI output, ensuring the model correctly penalizes chemically valid but practically inefficient steps.

Expert Validation Benchmarks

The researchers validated the framework through a double-blind expert study. The panel consisted of 36 professional chemists who provided 368 valid evaluations of the proposed chemical pathways. The human experts aligned with Synthegy’s algorithmic assessments 71.2% of the time.

The system successfully detected redundant protecting steps, judged reaction feasibility, and selected optimal synthetic routes. The publication in Nature Machine Intelligence highlights a strict correlation between model size and domain performance. Larger models provide the required parameters to handle complex chemical reasoning, whereas smaller models fail to interpret nuanced strategic constraints.

Dual-Purpose Architecture

Synthegy delivers a dual-purpose capability for computational chemistry pipelines. It manages retrosynthesis by mapping backward from a final target molecule to accessible starting materials. It also charts forward reaction mechanisms by breaking down complex reactions into elementary electron movements.

The framework builds directly on EPFL’s 2024 ChemCrow project, which deployed specialized AI agents for autonomous synthesis. Synthegy shifts the paradigm from autonomous structural generation to guided algorithmic search, eliminating the high error rates that plague end-to-end generative chemistry models.

If you build computational tools for specialized scientific domains, Synthegy demonstrates the value of using LLMs as algorithmic guides rather than direct generators. Applying natural language evaluation layers to existing deterministic search trees yields higher accuracy and immediate utility in production environments.

Get Insanely Good at AI

Get Insanely Good at AI

The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.

Keep Reading