Javascript must be enabled to continue!
Adaptive Representation of Molecules and Materials in Bayesian Optimization
View through CrossRef
Bayesian optimization (BO) is increasingly used in molecular optimization and in guiding self-driving laboratories for automated materials discovery. A crucial aspect of BO is how molecules and materials are represented as feature vectors, where both the completeness and compactness of these representations can influence the efficiency of the optimization process. Traditionally, a fixed representation is chosen by expert chemists or applying data-driven feature selection methods on available labelled datasets. However, when dealing with novel optimization tasks, prior knowledge or large datasets are often unavailable, and relying on these even can introduce bias into the search process. In this work, we demonstrate a Feature Adaptive Bayesian Optimization (FABO) framework, which integrates feature selection in the Bayesian optimization process to dynamically adapt material representations throughout the optimization cycles. We demonstrate the effectiveness of this adaptive approach across several molecular optimization tasks, including the discovery of high-performing metal-organic frameworks (MOFs) in three distinct tasks, each involving unique property distributions and requiring a distinct representation. Our results show that the adaptive nature of the representation leads to outperforming random search baseline and scenarios where prior knowledge of the feature space is available. Notably, for known optimization tasks, FABO automatically identifies representations that are aligned with human chemical intuition, validating its utility for optimization tasks where such insights are not available in advance. Lastly, we show how a biased representation can adversely impact BO performance, highlighting the importance of adaptive representation to different tasks. Our findings highlight FABO as a robust approach for navigating large, complex materials search spaces in automated discovery campaigns.
American Chemical Society (ACS)
Title: Adaptive Representation of Molecules and Materials in Bayesian Optimization
Description:
Bayesian optimization (BO) is increasingly used in molecular optimization and in guiding self-driving laboratories for automated materials discovery.
A crucial aspect of BO is how molecules and materials are represented as feature vectors, where both the completeness and compactness of these representations can influence the efficiency of the optimization process.
Traditionally, a fixed representation is chosen by expert chemists or applying data-driven feature selection methods on available labelled datasets.
However, when dealing with novel optimization tasks, prior knowledge or large datasets are often unavailable, and relying on these even can introduce bias into the search process.
In this work, we demonstrate a Feature Adaptive Bayesian Optimization (FABO) framework, which integrates feature selection in the Bayesian optimization process to dynamically adapt material representations throughout the optimization cycles.
We demonstrate the effectiveness of this adaptive approach across several molecular optimization tasks, including the discovery of high-performing metal-organic frameworks (MOFs) in three distinct tasks, each involving unique property distributions and requiring a distinct representation.
Our results show that the adaptive nature of the representation leads to outperforming random search baseline and scenarios where prior knowledge of the feature space is available.
Notably, for known optimization tasks, FABO automatically identifies representations that are aligned with human chemical intuition, validating its utility for optimization tasks where such insights are not available in advance.
Lastly, we show how a biased representation can adversely impact BO performance, highlighting the importance of adaptive representation to different tasks.
Our findings highlight FABO as a robust approach for navigating large, complex materials search spaces in automated discovery campaigns.
Related Results
Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...
SAFE-OPT: A Bayesian optimization algorithm for learning optimal deep brain stimulation parameters with safety constraints
SAFE-OPT: A Bayesian optimization algorithm for learning optimal deep brain stimulation parameters with safety constraints
AbstractTo treat neurological and psychiatric diseases with deep brain stimulation, a trained clinician must select parameters for each patient by monitoring their symptoms and sid...
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...
Adaptive Bayesian Optimization for State-Dependent Brain Stimulation
Adaptive Bayesian Optimization for State-Dependent Brain Stimulation
AbstractBrain stimulation has become an important treatment option for a variety of neurological and psychiatric diseases. A key challenge in improving brain stimulation is selecti...
Full Bayesian models for paired RNA-seq data and Bayesian equivalence test
Full Bayesian models for paired RNA-seq data and Bayesian equivalence test
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] "In my doctorate research, I have developed Bayesian models to analyze the paired RNAseq data for different t...
The Oxford Handbook of Bayesian Econometrics
The Oxford Handbook of Bayesian Econometrics
Bayesian econometric methods have enjoyed an increase in popularity in recent years. Econometricians, empirical economists, and policymakers are increasingly making use of Bayesian...
From p-values to Bayes Factor: A Meta-Analytic Comparison in Colorectal Research
From p-values to Bayes Factor: A Meta-Analytic Comparison in Colorectal Research
Abstract
The prevalent method for synthesizing evidence from multiple studies is the frequentist meta-analysis, which relies on assumptions of long-term frequencies and d...

