Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Improving QSAR model predictions using ensembled heterogenous features

View through CrossRef
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for experimental testing. However, many existing QSAR methods still suffer from limited applicability domains, overfitting, and instability across different datasets. To address these challenges, we present GT-QSAR, an automated machine learning workflow for structure-centric QSAR modeling that integrates heterogeneous molecular representations through ensemble learning. GT-QSAR supports both receptor- and ligand-based modeling strategies and combines diverse feature groups, including molecular descriptors, pharmacophore fingerprints, graph-based embeddings, grid-based shape and electrostatic representations, and protein–ligand interaction features. Systematic benchmarking using the Papyrus dataset demonstrates that GT-QSAR provides improved predictive accuracy, model stability, and generalizability compared with control tools. In addition, virtual screening experiments using the LIT-PCBA benchmark show that GT-QSAR substantially improves enrichment compared with conventional similarity-based and docking-based screening methods and previously reported target-specific machine learning models. These results demonstrate that GT-QSAR provides a robust and generalizable framework for QSAR modeling, enabling improved prioritization of active compounds in realistic drug discovery campaigns. The data used for QSAR modeling as part of this work is made publicly available. GT-QSAR is available online at https://gtqsar.standigm.com.
Title: Improving QSAR model predictions using ensembled heterogenous features
Description:
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for experimental testing.
However, many existing QSAR methods still suffer from limited applicability domains, overfitting, and instability across different datasets.
To address these challenges, we present GT-QSAR, an automated machine learning workflow for structure-centric QSAR modeling that integrates heterogeneous molecular representations through ensemble learning.
GT-QSAR supports both receptor- and ligand-based modeling strategies and combines diverse feature groups, including molecular descriptors, pharmacophore fingerprints, graph-based embeddings, grid-based shape and electrostatic representations, and protein–ligand interaction features.
Systematic benchmarking using the Papyrus dataset demonstrates that GT-QSAR provides improved predictive accuracy, model stability, and generalizability compared with control tools.
In addition, virtual screening experiments using the LIT-PCBA benchmark show that GT-QSAR substantially improves enrichment compared with conventional similarity-based and docking-based screening methods and previously reported target-specific machine learning models.
These results demonstrate that GT-QSAR provides a robust and generalizable framework for QSAR modeling, enabling improved prioritization of active compounds in realistic drug discovery campaigns.
The data used for QSAR modeling as part of this work is made publicly available.
GT-QSAR is available online at https://gtqsar.
standigm.
com.

Related Results

Improving QSAR model predictions using ensembled heterogenous features
Improving QSAR model predictions using ensembled heterogenous features
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for ...
Quantitative Structure-Activity Relationship (QSAR) in Drug Discovery and Development
Quantitative Structure-Activity Relationship (QSAR) in Drug Discovery and Development
Quantitative structure-activity relationship (QSAR) analysis represents a cornerstone approach in modern drug discovery and development. QSAR methodologies establish mathematical c...
4D-QSAR Analyses for EGFR Inhibitors Based on CDDA-OPS-GA Method
4D-QSAR Analyses for EGFR Inhibitors Based on CDDA-OPS-GA Method
Epidermal growth factor receptor (EGFR) is a preferred target for treating cancer. Compared to 3D-QSAR, 4D-QSAR has the feature of conformational flexibility and free alignment for...
QSAR and docking study: A review
QSAR and docking study: A review
Quantitative structure–activity relationship models (QSAR models) are regression or classification models used in the chemical and biological sciences and engineering. Like other r...
QSAR Modeling Based on Conformation Ensembles Using a Multi-Instance Learning Approach
QSAR Modeling Based on Conformation Ensembles Using a Multi-Instance Learning Approach
Modern QSAR approaches have wide practical applications in drug discovery for screening potentially bioactive molecules before their experimental testing. Most models predicting ...
APRENDIZADO DE MÁQUINA APLICADO A QSAR
APRENDIZADO DE MÁQUINA APLICADO A QSAR
MACHINE LEARNING APLLIED TO QSAR. Over the years the study of the quantitative structure-activity relationship (QSAR) has transformed from a simple regression analysis to the imple...
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDY OF ESTER-BASED FERULIC ACID DERIVATIVES AGAINST CERVICAL CANCER CELL (HELA)
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDY OF ESTER-BASED FERULIC ACID DERIVATIVES AGAINST CERVICAL CANCER CELL (HELA)
Quantitative structure-activity relationship (QSAR) has been studied for ferulic acid derivatives to determine the QSAR model able to predict anticancer. As the subject of this res...

Back to Top