Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Improving QSAR model predictions using ensembled heterogenous features

View through CrossRef
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for experimental testing. However, many existing QSAR methods still suffer from limited applicability domains, overfitting, and instability across different datasets. To address these challenges, we present GT-QSAR, an automated machine learning workflow for structure-centric QSAR modeling that integrates heterogeneous molecular representations through ensemble learning. GT-QSAR supports both receptor- and ligand-based modeling strategies and combines diverse feature groups, including molecular descriptors, pharmacophore fingerprints, graph-based embeddings, grid-based shape and electrostatic representations, and protein–ligand interaction features. Systematic benchmarking using the Papyrus dataset demonstrates that GT-QSAR provides improved predictive accuracy, model stability, and generalizability compared with control tools. In addition, virtual screening experiments using the LIT-PCBA benchmark show that GT-QSAR substantially improves enrichment compared with conventional similarity-based and docking-based screening methods and previously reported target-specific machine learning models. These results demonstrate that GT-QSAR provides a robust and generalizable framework for QSAR modeling, enabling improved prioritization of active compounds in realistic drug discovery campaigns. The data used for QSAR modeling as part of this work is made publicly available. GT-QSAR is available online at https://gtqsar.standigm.com.
Title: Improving QSAR model predictions using ensembled heterogenous features
Description:
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for experimental testing.
However, many existing QSAR methods still suffer from limited applicability domains, overfitting, and instability across different datasets.
To address these challenges, we present GT-QSAR, an automated machine learning workflow for structure-centric QSAR modeling that integrates heterogeneous molecular representations through ensemble learning.
GT-QSAR supports both receptor- and ligand-based modeling strategies and combines diverse feature groups, including molecular descriptors, pharmacophore fingerprints, graph-based embeddings, grid-based shape and electrostatic representations, and protein–ligand interaction features.
Systematic benchmarking using the Papyrus dataset demonstrates that GT-QSAR provides improved predictive accuracy, model stability, and generalizability compared with control tools.
In addition, virtual screening experiments using the LIT-PCBA benchmark show that GT-QSAR substantially improves enrichment compared with conventional similarity-based and docking-based screening methods and previously reported target-specific machine learning models.
These results demonstrate that GT-QSAR provides a robust and generalizable framework for QSAR modeling, enabling improved prioritization of active compounds in realistic drug discovery campaigns.
The data used for QSAR modeling as part of this work is made publicly available.
GT-QSAR is available online at https://gtqsar.
standigm.
com.

Related Results

Improving QSAR model predictions using ensembled heterogenous features
Improving QSAR model predictions using ensembled heterogenous features
Quantitative structure–activity relationship (QSAR) models are widely used computational tools in drug discovery for predicting molecular activities and prioritizing compounds for ...
Quantitative Structure-Activity Relationship (QSAR) in Drug Discovery and Development
Quantitative Structure-Activity Relationship (QSAR) in Drug Discovery and Development
Quantitative structure-activity relationship (QSAR) analysis represents a cornerstone approach in modern drug discovery and development. QSAR methodologies establish mathematical c...
4D-QSAR Analyses for EGFR Inhibitors Based on CDDA-OPS-GA Method
4D-QSAR Analyses for EGFR Inhibitors Based on CDDA-OPS-GA Method
Epidermal growth factor receptor (EGFR) is a preferred target for treating cancer. Compared to 3D-QSAR, 4D-QSAR has the feature of conformational flexibility and free alignment for...
QSAR and docking study: A review
QSAR and docking study: A review
Quantitative structure–activity relationship models (QSAR models) are regression or classification models used in the chemical and biological sciences and engineering. Like other r...
APRENDIZADO DE MÁQUINA APLICADO A QSAR
APRENDIZADO DE MÁQUINA APLICADO A QSAR
MACHINE LEARNING APLLIED TO QSAR. Over the years the study of the quantitative structure-activity relationship (QSAR) has transformed from a simple regression analysis to the imple...
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDY OF ESTER-BASED FERULIC ACID DERIVATIVES AGAINST CERVICAL CANCER CELL (HELA)
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDY OF ESTER-BASED FERULIC ACID DERIVATIVES AGAINST CERVICAL CANCER CELL (HELA)
Quantitative structure-activity relationship (QSAR) has been studied for ferulic acid derivatives to determine the QSAR model able to predict anticancer. As the subject of this res...
Designing Derivative Compounds of 4-Chlorophenyloxy N-Alkyl Phosphoramidates as Anti-Cervical Cancer Agents Based on QSAR Model
Designing Derivative Compounds of 4-Chlorophenyloxy N-Alkyl Phosphoramidates as Anti-Cervical Cancer Agents Based on QSAR Model
Design of 4-chlorophenyloxy n-alkyl phosphoramidates derivative compounds as anti-cervical cancer agents using the QSAR model research was purposed to determine the best QSAR equat...

Back to Top