Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Multimodal gene embeddings for drug-target prediction and lineage reconstruction

View through CrossRef
ABSTRACT Understanding how gene function emerges across molecular, cellular, and pharmacologic contexts remains a central challenge in systems biology and drug discovery. Conventional computational models typically operate within a single modality, such as expression, ontology, or interaction networks, limiting their ability to capture the multidimensional nature of gene function. Here, we present NEWT (Neural Embeddings for Wide-spectrum Targeting), a multimodal deep learning framework that integrates heterogeneous biological knowledge into a unified and interpretable representation space. By combining functional annotations, large-scale co-expression data, pathway information, lineage programs, transcriptional regulons, and protein-protein interaction features through an attention-guided fusion architecture, NEWT learns cross-modal dependencies that reflect both global functional hierarchies and context-specific regulatory relationships. Applied to L1000 perturbational transcriptomes, NEWT achieves higher compound-target prediction accuracy than prior embedding models and reconstructs pharmacological networks that reveal mechanistic and repurposing opportunities. When extended to single-cell RNA-seq data, NEWT preserves developmental trajectories and enhances the resolution of lineage hierarchies. Together, these results demonstrate that multimodal gene embeddings can bridge pharmacogenomic and single-cell transcriptomic analyses within a common functional geometry, establishing a scalable foundation for integrative target discovery and systems-level modeling of cellular identity.
Title: Multimodal gene embeddings for drug-target prediction and lineage reconstruction
Description:
ABSTRACT Understanding how gene function emerges across molecular, cellular, and pharmacologic contexts remains a central challenge in systems biology and drug discovery.
Conventional computational models typically operate within a single modality, such as expression, ontology, or interaction networks, limiting their ability to capture the multidimensional nature of gene function.
Here, we present NEWT (Neural Embeddings for Wide-spectrum Targeting), a multimodal deep learning framework that integrates heterogeneous biological knowledge into a unified and interpretable representation space.
By combining functional annotations, large-scale co-expression data, pathway information, lineage programs, transcriptional regulons, and protein-protein interaction features through an attention-guided fusion architecture, NEWT learns cross-modal dependencies that reflect both global functional hierarchies and context-specific regulatory relationships.
Applied to L1000 perturbational transcriptomes, NEWT achieves higher compound-target prediction accuracy than prior embedding models and reconstructs pharmacological networks that reveal mechanistic and repurposing opportunities.
When extended to single-cell RNA-seq data, NEWT preserves developmental trajectories and enhances the resolution of lineage hierarchies.
Together, these results demonstrate that multimodal gene embeddings can bridge pharmacogenomic and single-cell transcriptomic analyses within a common functional geometry, establishing a scalable foundation for integrative target discovery and systems-level modeling of cellular identity.

Related Results

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Abstract 902: Explainable AI: Graph machine learning for response prediction and biomarker discovery
Abstract 902: Explainable AI: Graph machine learning for response prediction and biomarker discovery
Abstract Accurately predicting drug sensitivity and understanding what is driving it are major challenges in drug discovery. Graphs are a natural framework for captu...
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
ANDES: a novel best-match approach for enhancing gene set analysis in embedding spaces
ANDES: a novel best-match approach for enhancing gene set analysis in embedding spaces
AbstractEmbedding methods have emerged as a valuable class of approaches for distilling essential information from complex high-dimensional data into more accessible lower-dimensio...
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
Multimodal sentiment analysis is an essential task in natural language processing which refers to the fact that machines can analyze and recognize emotions through logical reasonin...
When Word Embeddings Become Endangered
When Word Embeddings Become Endangered
Big languages such as English and Finnish have many natural language processing (NLP) resources and models, but this is not the case for low-resourced and endangered languages as s...

Back to Top