Large language models (LLMs) can perform accurate classification with zero or few examples through in-context learning (ICL), allowing the model to observe query-relevant examples at inference time and eliminating the need for additional weight updates to generalize beyond its original training data. We extend this capability to regression with uncertainty estimation using frozen LLMs (e.g., GPT-4o, Gemini), enabling Bayesian optimization (BO) in natural language without explicit model training or feature engineering. We apply this to materials discovery by representing materials as synthesis and testing procedures for use in natural language prompts. This Bayesian, design-first approach prioritizes optimization toward target material properties before detailed characterization, in contrast to conventional experimental workflows that often emphasize characterization of suboptimal materials. On benchmarks like aqueous solubility and oxidative coupling of methane (OCM), BO-ICL matches or outperforms Gaussian processes. In live experiments on the reverse water–gas shift (RWGS) reaction, BO-ICL identifies multimetallic catalysts that approach equilibrium CO yield within 6 and 10 iterations from a pool of 3,700 and 360,000 candidates, respectively. Our method redefines materials representation and accelerates discovery, with broad applications across catalysis, materials science, and AI. Code: https://github.com/ur-whitelab/BO-ICL.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mayk Caldas Ramos
Shane S. Michtavy
Andrew D. White
ACS Central Science
University of Rochester
Sustainability Institute
Edison Pharmaceuticals (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Ramos et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69e1cd6f5cdc762e9d856fe4 — DOI: https://doi.org/10.1021/acscentsci.5c02418
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: