Javascript must be enabled to continue!
Support Vector Machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II
View through CrossRef
AbstractAssessing the hERG liability in the early stages of drug discovery programs is important. The recent increase of hERG-related information in public databases enabled various successful applications of machine learning techniques to predict hERG inhibition. However, most of these researches constructed the datasets from only one database, limiting the predictability and scope of the models. In this study, a hERG classification model was constructed using the largest dataset for hERG inhibition built by integrating multiple databases. The integrated dataset consisted of more than 291,000 structurally diverse compounds derived from ChEMBL, GOSTAR, PubChem, and hERGCentral. The prediction model was built by support vector machine (SVM) with descriptor selection based on Non-dominated Sorting Genetic Algorithm-II (NSGA-II) to optimize the descriptor set for maximum prediction performance with the minimal number of descriptors. The SVM classification model using 72 selected descriptors and ECFP_4 structural fingerprints recorded kappa statistics of 0.733 and accuracy of 0.984 for the test set, substantially outperforming the prediction performance of the current commercial applications for hERG prediction. Finally, the applicability domain of the prediction model was assessed based on the molecular similarity between the training set and test set compounds.
Springer Science and Business Media LLC
Title: Support Vector Machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II
Description:
AbstractAssessing the hERG liability in the early stages of drug discovery programs is important.
The recent increase of hERG-related information in public databases enabled various successful applications of machine learning techniques to predict hERG inhibition.
However, most of these researches constructed the datasets from only one database, limiting the predictability and scope of the models.
In this study, a hERG classification model was constructed using the largest dataset for hERG inhibition built by integrating multiple databases.
The integrated dataset consisted of more than 291,000 structurally diverse compounds derived from ChEMBL, GOSTAR, PubChem, and hERGCentral.
The prediction model was built by support vector machine (SVM) with descriptor selection based on Non-dominated Sorting Genetic Algorithm-II (NSGA-II) to optimize the descriptor set for maximum prediction performance with the minimal number of descriptors.
The SVM classification model using 72 selected descriptors and ECFP_4 structural fingerprints recorded kappa statistics of 0.
733 and accuracy of 0.
984 for the test set, substantially outperforming the prediction performance of the current commercial applications for hERG prediction.
Finally, the applicability domain of the prediction model was assessed based on the molecular similarity between the training set and test set compounds.
Related Results
Hysteretic hERG Channel Gating Current Recorded At Physiological Temperature
Hysteretic hERG Channel Gating Current Recorded At Physiological Temperature
Abstract
Cardiac hERG channels comprise at least two subunits, hERG 1a and hERG 1b, and drive cardiac action potential repolarization. hERG 1a subunits contain a cytoplasmi...
hERG-LTN: A New Paradigm in hERG Cardiotoxicity Assessment Using Neuro-Symbolic and Generative AI Embedding (MegaMolBART, Llama3.2, Gemini, DeepSeek) Approach
hERG-LTN: A New Paradigm in hERG Cardiotoxicity Assessment Using Neuro-Symbolic and Generative AI Embedding (MegaMolBART, Llama3.2, Gemini, DeepSeek) Approach
AbstractAssessing adverse drug reactions (ADRs) during drug development is essential for ensuring the safety of new compounds. The blockade of the Ether-a-go-go-related gene (hERG)...
CLOP-hERG: The Contrastive Learning Optimized Pre-trained Model for Representation Learning in Predicting Drug-Induced hERG Channel Blockers
CLOP-hERG: The Contrastive Learning Optimized Pre-trained Model for Representation Learning in Predicting Drug-Induced hERG Channel Blockers
During drug development, ensuring that drug molecules do not block the hERG (human Ether-à-go-go-Related Gene) channel is critical. If this channel is blocked, it can cause many ca...
Reducing hERG Toxicity Using hERG Classification Model and Fragment-growing Network
Reducing hERG Toxicity Using hERG Classification Model and Fragment-growing Network
Drug-induced cardiotoxicity has become one of the major reasons leading to drug withdrawal in past decades, which is closely related to the blockade of human Ether-a-go-go-related ...
Reducing hERG Toxicity Using Reliable hERG Classification Model and Fragment Grow Model
Reducing hERG Toxicity Using Reliable hERG Classification Model and Fragment Grow Model
Drug-induced cardiotoxicity has become one of the major reasons leading to drug withdrawal in past decades, which is closely related to the blockade of human Ether-a-go-go-related ...
Quantitative structure-activity relationship study on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a 1,2,4-Triazol-3-yl group as a ZBG
Quantitative structure-activity relationship study on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a 1,2,4-Triazol-3-yl group as a ZBG
QSAR study has been carried out on the MMP-13 inhibitory activity of fused pyrimidine derivatives possessing a1,2,4-triazol-3-yl group as a ZBG in 0D- to 2D-Dragon descriptors. The...
cAMP Performs a HERG-culean Task
cAMP Performs a HERG-culean Task
HERG, the pore-forming subunit of the rapidly activating delayed rectifier K
+
channel, is regulated by cAMP; however, the mechanism of control remains unkn...

