Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Multi-Modal Protein Representation Learning with CLASP

View through CrossRef
ABSTRACT Effectively integrating data modalities pertaining to proteins’ amino acid sequences, three-dimensional structures, and curated text-based descriptions of their biochemical and functional properties can lead to informative representations capturing different views of proteins. Here, we introduce CLASP, a unified tri-modal framework that combines the strengths of geometric deep learning, natural large language models (LLMs), protein language models (pLMs), and contrastive learning to learn informative protein representations based on their structure, amino acid sequence, and text-based biochemical and functional descriptions. We show that CLASP enables accurate zero-shot classification and retrieval tasks, such as matching a protein structure to its sequence or description, outperforming state-of-the-art baselines. CLASP embeddings also exhibit superior clustering by protein family, and ablation studies confirm that all three modalities contribute synergistically to performance. Our results highlight the power of integrating structural, sequential, and textual signals in a single model, establishing CLASP as a general-purpose embedding framework for protein understanding.
Title: Multi-Modal Protein Representation Learning with CLASP
Description:
ABSTRACT Effectively integrating data modalities pertaining to proteins’ amino acid sequences, three-dimensional structures, and curated text-based descriptions of their biochemical and functional properties can lead to informative representations capturing different views of proteins.
Here, we introduce CLASP, a unified tri-modal framework that combines the strengths of geometric deep learning, natural large language models (LLMs), protein language models (pLMs), and contrastive learning to learn informative protein representations based on their structure, amino acid sequence, and text-based biochemical and functional descriptions.
We show that CLASP enables accurate zero-shot classification and retrieval tasks, such as matching a protein structure to its sequence or description, outperforming state-of-the-art baselines.
CLASP embeddings also exhibit superior clustering by protein family, and ablation studies confirm that all three modalities contribute synergistically to performance.
Our results highlight the power of integrating structural, sequential, and textual signals in a single model, establishing CLASP as a general-purpose embedding framework for protein understanding.

Related Results

ANALISIS MODAL KERJA PADA KOPERASI SERBA USAHA DI KOTA METRO
ANALISIS MODAL KERJA PADA KOPERASI SERBA USAHA DI KOTA METRO
Modal kerja merupakan suatu kekayaan yang digunakan untuk membelanjai perusahaan sehari-hari. Modal kerja biasanya berbentuk uang kas, piutang, persediaan barang yang kesemuanya it...
EFFECT OF DESIGN CHANGING OF RING CLASP ON ITS RETENTIVE FORCE
EFFECT OF DESIGN CHANGING OF RING CLASP ON ITS RETENTIVE FORCE
Different designs of ring clasp were indicated in short or long span bounded saddle. However, few researches have been done to calculate their retentive absolute forces. The purpos...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)
Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)
AbstractThe Untia fishing village community was formed from the relocation of the residents of Laelae Island in 1998. The community that was built from the results of relocation ha...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Objectives: The purpose of this study was to evaluate the different styles of learning preferred by undergraduate medical students from 1st to 5th year of Al-Tibri Medical College ...
CLASP RETENTION USING VARIABLE UNDERCUT DEPTHS
CLASP RETENTION USING VARIABLE UNDERCUT DEPTHS
Retentive force may be increased in deeper undercuts. Three clasps were examined for this hypothesis in order to analyze the retentive force change properties for each clasp design...
Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes ...

Back to Top