Javascript must be enabled to continue!
Multimodal gene embeddings for drug-target prediction and lineage reconstruction
View through CrossRef
ABSTRACT
Understanding how gene function emerges across molecular, cellular, and pharmacologic contexts remains a central challenge in systems biology and drug discovery. Conventional computational models typically operate within a single modality, such as expression, ontology, or interaction networks, limiting their ability to capture the multidimensional nature of gene function. Here, we present NEWT (Neural Embeddings for Wide-spectrum Targeting), a multimodal deep learning framework that integrates heterogeneous biological knowledge into a unified and interpretable representation space. By combining functional annotations, large-scale co-expression data, pathway information, lineage programs, transcriptional regulons, and protein-protein interaction features through an attention-guided fusion architecture, NEWT learns cross-modal dependencies that reflect both global functional hierarchies and context-specific regulatory relationships. Applied to L1000 perturbational transcriptomes, NEWT achieves higher compound-target prediction accuracy than prior embedding models and reconstructs pharmacological networks that reveal mechanistic and repurposing opportunities. When extended to single-cell RNA-seq data, NEWT preserves developmental trajectories and enhances the resolution of lineage hierarchies. Together, these results demonstrate that multimodal gene embeddings can bridge pharmacogenomic and single-cell transcriptomic analyses within a common functional geometry, establishing a scalable foundation for integrative target discovery and systems-level modeling of cellular identity.
Title: Multimodal gene embeddings for drug-target prediction and lineage reconstruction
Description:
ABSTRACT
Understanding how gene function emerges across molecular, cellular, and pharmacologic contexts remains a central challenge in systems biology and drug discovery.
Conventional computational models typically operate within a single modality, such as expression, ontology, or interaction networks, limiting their ability to capture the multidimensional nature of gene function.
Here, we present NEWT (Neural Embeddings for Wide-spectrum Targeting), a multimodal deep learning framework that integrates heterogeneous biological knowledge into a unified and interpretable representation space.
By combining functional annotations, large-scale co-expression data, pathway information, lineage programs, transcriptional regulons, and protein-protein interaction features through an attention-guided fusion architecture, NEWT learns cross-modal dependencies that reflect both global functional hierarchies and context-specific regulatory relationships.
Applied to L1000 perturbational transcriptomes, NEWT achieves higher compound-target prediction accuracy than prior embedding models and reconstructs pharmacological networks that reveal mechanistic and repurposing opportunities.
When extended to single-cell RNA-seq data, NEWT preserves developmental trajectories and enhances the resolution of lineage hierarchies.
Together, these results demonstrate that multimodal gene embeddings can bridge pharmacogenomic and single-cell transcriptomic analyses within a common functional geometry, establishing a scalable foundation for integrative target discovery and systems-level modeling of cellular identity.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Imagined worldviews in John Lennon’s “Imagine”: a multimodal re-performance / Visões de mundo imaginadas no “Imagine” de John Lennon: uma re-performance multimodal
Imagined worldviews in John Lennon’s “Imagine”: a multimodal re-performance / Visões de mundo imaginadas no “Imagine” de John Lennon: uma re-performance multimodal
Abstract: This paper addresses the issue of multimodal re-performance, a concept developed by us, in view of the fact that the famous song “Imagine”, by John Lennon, was published ...
Abstract 902: Explainable AI: Graph machine learning for response prediction and biomarker discovery
Abstract 902: Explainable AI: Graph machine learning for response prediction and biomarker discovery
Abstract
Accurately predicting drug sensitivity and understanding what is driving it are major challenges in drug discovery. Graphs are a natural framework for captu...
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT
Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
ANDES: a novel best-match approach for enhancing gene set analysis in embedding spaces
ANDES: a novel best-match approach for enhancing gene set analysis in embedding spaces
AbstractEmbedding methods have emerged as a valuable class of approaches for distilling essential information from complex high-dimensional data into more accessible lower-dimensio...
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
Multimodal sentiment analysis is an essential task in natural language processing which refers to the fact that machines can analyze and recognize emotions through logical reasonin...
The Practice of Multimodal Analgesia Technique for Patients Undergoing Surgery under General Anaesthesia in Debre Markos Compersive Specialized Hospital Debre Markos, East Gojjam, Ethiopia, 2022. A Cross-Sectional Study
The Practice of Multimodal Analgesia Technique for Patients Undergoing Surgery under General Anaesthesia in Debre Markos Compersive Specialized Hospital Debre Markos, East Gojjam, Ethiopia, 2022. A Cross-Sectional Study
Background: Practice guidelines for preoperative pain management recommend that multimodal analgesic therapy should be used for postsurgical patients. This method uses different a...
When Word Embeddings Become Endangered
When Word Embeddings Become Endangered
Big languages such as English and Finnish have many natural language processing (NLP) resources and models, but this is not the case for low-resourced and endangered languages as s...

