Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

COSMO-NET: Fast and Accurate Machine Learning Surrogates for COSMO-based Molecular Descriptors

View through CrossRef
Accurate predictive thermodynamic models are essential tools for the computational design of new molecules. Models based on COnductor-like Screening Models (COSMO), such as COSMO-SAC and COSMO-RS, are well-suited for this purpose but require computationally expensive quantum mechanical (QM) calculations to generate key properties-the surface charge density distribution (p(σ)), the surface area (A) and the cavity volume (V)-for each molecule. Here we develop graph-based neural network surrogate models, based on a directed message passing neural network (DMPNN) and a graph convolutional network (GCN), to predict these molecular properties rapidly and accurately. We train the models on a dataset of over 16,000 compounds generated through an automated QM calculation pipeline. The DMPNN model outperforms GCN for probability of surface charge density prediction, while the GCN shows higher accuracy for surface area and cavity volume. We build on these strengths to propose a hybrid model, COSMO-NET. When applied to predict octanol-water partition coefficients, COSMO-NET achieves a mean absolute error (MAE) of 0.31 compared to 0.34 and 0.36 for DMPNN and GCN, respectively. These results demonstrate that machine learning surrogates can replace costly QM calculations while maintaining accuracy, supporting the discovery of new molecules and the evaluation of their performance.
Title: COSMO-NET: Fast and Accurate Machine Learning Surrogates for COSMO-based Molecular Descriptors
Description:
Accurate predictive thermodynamic models are essential tools for the computational design of new molecules.
Models based on COnductor-like Screening Models (COSMO), such as COSMO-SAC and COSMO-RS, are well-suited for this purpose but require computationally expensive quantum mechanical (QM) calculations to generate key properties-the surface charge density distribution (p(σ)), the surface area (A) and the cavity volume (V)-for each molecule.
Here we develop graph-based neural network surrogate models, based on a directed message passing neural network (DMPNN) and a graph convolutional network (GCN), to predict these molecular properties rapidly and accurately.
We train the models on a dataset of over 16,000 compounds generated through an automated QM calculation pipeline.
The DMPNN model outperforms GCN for probability of surface charge density prediction, while the GCN shows higher accuracy for surface area and cavity volume.
We build on these strengths to propose a hybrid model, COSMO-NET.
When applied to predict octanol-water partition coefficients, COSMO-NET achieves a mean absolute error (MAE) of 0.
31 compared to 0.
34 and 0.
36 for DMPNN and GCN, respectively.
These results demonstrate that machine learning surrogates can replace costly QM calculations while maintaining accuracy, supporting the discovery of new molecules and the evaluation of their performance.

Related Results

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
The COSMO and COSMO‐RS solvation models
The COSMO and COSMO‐RS solvation models
AbstractThe conductor‐like screening model COSMO, a variant of the dielectric continuum solvation models, has become very popular due to its algorithmic simplicity, numerical stabi...
The COSMO and COSMO‐RS solvation models
The COSMO and COSMO‐RS solvation models
The conductor‐like screening model COSMO, a variant of the dielectric continuum solvation models, has become very popular due to its algorithmic simplicity, numerical stability, an...
Chemical bond overlap descriptors from multiconfiguration wavefunctions
Chemical bond overlap descriptors from multiconfiguration wavefunctions
Chemical bonds are fundamental in chemistry, serving as the foundation for understanding molecular properties. Over time, various theories and descriptors have evolved to character...
The COSMO-Ru2By configuration of the COSMO model: skill and methodology for estimating of the forecasts of - and γ-mesoscale processes
The COSMO-Ru2By configuration of the COSMO model: skill and methodology for estimating of the forecasts of - and γ-mesoscale processes
The paper gives а brief description of the COSMO-Ru2By configuration (the grid spacing is 2.2 km) of the COSMO model, which provides numerical weather forecasts for up to 48 hours ...
212 EFFECT OF OVULATION STATUS ON PREGNANCY RATES IN RECIPIENT GILTS TRANSFERRED WITH CLONED EMBRYO TRANSFER
212 EFFECT OF OVULATION STATUS ON PREGNANCY RATES IN RECIPIENT GILTS TRANSFERRED WITH CLONED EMBRYO TRANSFER
This study was conducted to determine the optimal stage of the estrus cycle to achieve the highest pregnancy rates in surrogate recipient gilts receiving somatic cell nuclear trans...

Back to Top