Javascript must be enabled to continue!

Multi-Modal Protein Representation Learning with CLASP

ABSTRACT Effectively integrating data modalities pertaining to proteins’ amino acid sequences, three-dimensional structures, and curated text-based descriptions of their biochemical and functional properties can lead to informative representations capturing different views of proteins. Here, we introduce CLASP, a unified tri-modal framework that combines the strengths of geometric deep learning, natural large language models (LLMs), protein language models (pLMs), and contrastive learning to learn informative protein representations based on their structure, amino acid sequence, and text-based biochemical and functional descriptions. We show that CLASP enables accurate zero-shot classification and retrieval tasks, such as matching a protein structure to its sequence or description, outperforming state-of-the-art baselines. CLASP embeddings also exhibit superior clustering by protein family, and ablation studies confirm that all three modalities contribute synergistically to performance. Our results highlight the power of integrating structural, sequential, and textual signals in a single model, establishing CLASP as a general-purpose embedding framework for protein understanding.

openRxiv

Nicolas Bolouri Joseph Szymborski Amin Emad

2025

Title: Multi-Modal Protein Representation Learning with CLASP

Description:

Here, we introduce CLASP, a unified tri-modal framework that combines the strengths of geometric deep learning, natural large language models (LLMs), protein language models (pLMs), and contrastive learning to learn informative protein representations based on their structure, amino acid sequence, and text-based biochemical and functional descriptions.

We show that CLASP enables accurate zero-shot classification and retrieval tasks, such as matching a protein structure to its sequence or description, outperforming state-of-the-art baselines.

CLASP embeddings also exhibit superior clustering by protein family, and ablation studies confirm that all three modalities contribute synergistically to performance.

Our results highlight the power of integrating structural, sequential, and textual signals in a single model, establishing CLASP as a general-purpose embedding framework for protein understanding.

Back

Modal kerja merupakan suatu kekayaan yang digunakan untuk membelanjai perusahaan sehari-hari. Modal kerja biasanya berbentuk uang kas, piutang, persediaan barang yang kesemuanya it...

EFFECT OF DESIGN CHANGING OF RING CLASP ON ITS RETENTIVE FORCE

Different designs of ring clasp were indicated in short or long span bounded saddle. However, few researches have been done to calculate their retentive absolute forces. The purpos...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)

AbstractThe Untia fishing village community was formed from the relocation of the residents of Laelae Island in 1998. The community that was built from the results of relocation ha...

Endothelial Protein C Receptor

IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...

Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi

Objectives: The purpose of this study was to evaluate the different styles of learning preferred by undergraduate medical students from 1st to 5th year of Al-Tibri Medical College ...

CLASP RETENTION USING VARIABLE UNDERCUT DEPTHS

Retentive force may be increased in deeper undercuts. Three clasps were examined for this hypothesis in order to analyze the retentive force change properties for each clasp design...

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes ...

Email:
Password:

Email:

Multi-Modal Protein Representation Learning with CLASP

Related Results