Javascript must be enabled to continue!
LM-PROTAC: a language model-driven PROTAC generation pipeline with dual constraints of structure and property
View through CrossRef
Abstract
The imperfect modeling of ternary complexes has limited the application of computer-aided drug discovery tools in PROTAC research and development. In this study, a language model for PROTAC molecule design pipeline named LM-PROTAC was developed, which stands for language model-driven Proteolysis Targeting Chimera, by embedding a transformer-based generative model with dual constraints on structure and properties. This study started with the idea of segmentation and representation of molecules and protein. Firstly, a language model-driven affinity model for protein compounds to screen molecular fragments with high affinity for the target protein. Secondly, structural and physicochemical properties of these fragments were constrained during the generation process to meet specific scenario requirements. Finally, a two-round screening was performed on the preliminary generated molecules using a multidimensional property prediction model. This process identified a batch of PROTAC molecules capable of degrading disease-relevant target proteins. These molecules were subsequently validated through in vitro experiments, thus providing a complete solution for language model-driven PROTAC drug generation. Taking Wnt3a, a key tumor-related target, as a POI of degradation, the LM-PROTAC pipeline successfully generated effective PROTAC molecules. The molecular distribution experiments demonstrated the high similarity of the generated molecules to the original dataset, validating the generative model’s effectiveness in accurately defining chemical space. Molecular dynamics simulations confirmed the stable interactions between the PROTAC molecules and target proteins, while protein degradation experiments verified the efficacy of the generated PROTAC molecules in degrading target proteins. The entire LM-PROTAC pipeline is reusable and can generate degraders for other target proteins within 50 days, significantly improving the efficiency of drug discovery for undruggable targets.
Springer Science and Business Media LLC
Title: LM-PROTAC: a language model-driven PROTAC generation pipeline with dual constraints of structure and property
Description:
Abstract
The imperfect modeling of ternary complexes has limited the application of computer-aided drug discovery tools in PROTAC research and development.
In this study, a language model for PROTAC molecule design pipeline named LM-PROTAC was developed, which stands for language model-driven Proteolysis Targeting Chimera, by embedding a transformer-based generative model with dual constraints on structure and properties.
This study started with the idea of segmentation and representation of molecules and protein.
Firstly, a language model-driven affinity model for protein compounds to screen molecular fragments with high affinity for the target protein.
Secondly, structural and physicochemical properties of these fragments were constrained during the generation process to meet specific scenario requirements.
Finally, a two-round screening was performed on the preliminary generated molecules using a multidimensional property prediction model.
This process identified a batch of PROTAC molecules capable of degrading disease-relevant target proteins.
These molecules were subsequently validated through in vitro experiments, thus providing a complete solution for language model-driven PROTAC drug generation.
Taking Wnt3a, a key tumor-related target, as a POI of degradation, the LM-PROTAC pipeline successfully generated effective PROTAC molecules.
The molecular distribution experiments demonstrated the high similarity of the generated molecules to the original dataset, validating the generative model’s effectiveness in accurately defining chemical space.
Molecular dynamics simulations confirmed the stable interactions between the PROTAC molecules and target proteins, while protein degradation experiments verified the efficacy of the generated PROTAC molecules in degrading target proteins.
The entire LM-PROTAC pipeline is reusable and can generate degraders for other target proteins within 50 days, significantly improving the efficiency of drug discovery for undruggable targets.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Interpretable PROTAC degradation prediction with structure-informed deep ternary attention framework
Interpretable PROTAC degradation prediction with structure-informed deep ternary attention framework
Proteolysis Targeting Chimeras (PROTACs) are heterobifunctional ligands that form ternary complexes with Protein Of Interests (POIs) and E3 ligases, exploiting the ubiquitin-protea...
Abstract 1685: Overcoming acquired resistance to PROTAC degraders
Abstract 1685: Overcoming acquired resistance to PROTAC degraders
Abstract
Background: Proteolysis-targeting chimera (PROTAC) technology has been widely investigated for cancer treatment and there have been several PROTAC degrader-...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Installation Analysis of Matterhorn Pipeline Replacement
Installation Analysis of Matterhorn Pipeline Replacement
Abstract
The paper describes the installation analysis for the Matterhorn field pipeline replacement, located in water depths between 800-ft to 1200-ft in the Gul...
Firm Performance, Financial Constraints, and Dual-Class Share Structure
Firm Performance, Financial Constraints, and Dual-Class Share Structure
<p><b>This thesis addresses two aspects of financial constraints focusing, firstly, on the impact of financial constraints on firm performance and, secondly, on the imp...
Firm Performance, Financial Constraints, and Dual-Class Share Structure
Firm Performance, Financial Constraints, and Dual-Class Share Structure
<p><b>This thesis addresses two aspects of financial constraints focusing, firstly, on the impact of financial constraints on firm performance and, secondly, on the imp...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...

