Javascript must be enabled to continue!
A contrastive adversarial encoder for multi-omics data integration
View through CrossRef
Early and accurate cancer detection is crucial for effective treatment, prognosis, and the advancement of precision medicine. Analyzing omics data is vital in cancer research. While using a single type of omics data provides a limited perspective, integrating multiple omics modalities allows for a more comprehensive understanding of cancer. Current deep models struggle to achieve efficient dimensionality reduction while preserving global information and integrating multi-omics data. This often results in feature redundancy or information loss, overlooking the synergies among different modalities. This paper proposes a contrastive adversarial encoder (CAEncoder) for multi-omics data integration to address this challenge. The proposed model combines a Vision Transformer (ViT) and a CycleGAN, trained in an end-to-end contrastive manner. The ViT is the encoder, utilizing self-attention, while the CycleGAN employs adversarial learning to ensure more discriminative and invariant latent space embeddings. Contrastive adversarial training improves representation quality by preventing information loss, eliminating redundancy, and capturing the synergies among different omics modalities. To ensure contrastive adversarial training, a composite loss function is used, consisting of a weighted combination of Adversarial Loss (Hinge Loss), Cycle Consistency Loss, and Triplet Margin Loss. The Adversarial Loss and Cycle Consistency Loss provide feedback from the CycleGAN, ensuring effective adversarial learning. Meanwhile, the Triplet Margin Loss promotes contrastive learning by pulling similar samples together and pushing dissimilar samples apart in the latent space. The performance of the CAEncoder is evaluated on downstream classification tasks, including both binary and multi-class classifications of five different cancer types. The results show that the model achieved a classification accuracy of up to 93.33% and an F1 score of 92.81%, outperforming existing advanced models. These findings demonstrate the potential of our method to enhance precision medicine for cancer through improved multi-omics data integration.
Public Library of Science (PLoS)
Title: A contrastive adversarial encoder for multi-omics data integration
Description:
Early and accurate cancer detection is crucial for effective treatment, prognosis, and the advancement of precision medicine.
Analyzing omics data is vital in cancer research.
While using a single type of omics data provides a limited perspective, integrating multiple omics modalities allows for a more comprehensive understanding of cancer.
Current deep models struggle to achieve efficient dimensionality reduction while preserving global information and integrating multi-omics data.
This often results in feature redundancy or information loss, overlooking the synergies among different modalities.
This paper proposes a contrastive adversarial encoder (CAEncoder) for multi-omics data integration to address this challenge.
The proposed model combines a Vision Transformer (ViT) and a CycleGAN, trained in an end-to-end contrastive manner.
The ViT is the encoder, utilizing self-attention, while the CycleGAN employs adversarial learning to ensure more discriminative and invariant latent space embeddings.
Contrastive adversarial training improves representation quality by preventing information loss, eliminating redundancy, and capturing the synergies among different omics modalities.
To ensure contrastive adversarial training, a composite loss function is used, consisting of a weighted combination of Adversarial Loss (Hinge Loss), Cycle Consistency Loss, and Triplet Margin Loss.
The Adversarial Loss and Cycle Consistency Loss provide feedback from the CycleGAN, ensuring effective adversarial learning.
Meanwhile, the Triplet Margin Loss promotes contrastive learning by pulling similar samples together and pushing dissimilar samples apart in the latent space.
The performance of the CAEncoder is evaluated on downstream classification tasks, including both binary and multi-class classifications of five different cancer types.
The results show that the model achieved a classification accuracy of up to 93.
33% and an F1 score of 92.
81%, outperforming existing advanced models.
These findings demonstrate the potential of our method to enhance precision medicine for cancer through improved multi-omics data integration.
Related Results
Benchmarking multi-omics integrative clustering methods for subtype identification in colorectal cancer
Benchmarking multi-omics integrative clustering methods for subtype identification in colorectal cancer
Abstract
Background and objectives
Colorectal cancer (CRC) represents a heterogeneous malignancy that has concerned global burden of incidence and mortality. The tradition...
Multi-omics Data Integration by Generative Adversarial Network
Multi-omics Data Integration by Generative Adversarial Network
Accurate disease phenotype prediction plays an important role in the treatment of heterogeneous diseases like cancer in the era of precision medicine. With the advent of high throu...
Improving Diversity and Quality of Adversarial Examples in Adversarial Transformation Network
Improving Diversity and Quality of Adversarial Examples in Adversarial Transformation Network
Abstract
This paper proposes a method to mitigate two major issues of Adversarial Transformation Networks (ATN) including the low diversity and the low quality of adversari...
Muon: multimodal omics analysis framework
Muon: multimodal omics analysis framework
AbstractAdvances in multi-omics technologies have led to an explosion of multimodal datasets to address questions ranging from basic biology to translation. While these rich data p...
MD2PR: A Multi-level Distillation based Dense Passage Retrieval Model
MD2PR: A Multi-level Distillation based Dense Passage Retrieval Model
Abstract
Reranker and retriever are two important components in information retrieval. The retriever typically adopts a dual-encoder model, where queries and docume...
Exploring the classification of cancer cell lines from multiple omic views
Exploring the classification of cancer cell lines from multiple omic views
Background
Cancer classification is of great importance to understanding its pathogenesis, making diagnosis and developing treatment. The accumulation of extensive o...
MOPower: an R-shiny application for the simulation and power calculation of multi-omics studies
MOPower: an R-shiny application for the simulation and power calculation of multi-omics studies
Abstract
Background
Multi-omics studies are increasingly used to help understand the underlying mechanisms of clinical phenotyp...
Development of a combined magnetic encoder
Development of a combined magnetic encoder
Purpose
As a type of angular displacement sensor, the Hall-effect magnetic encoder incorporates many advantages. While compared with the photoelectric encoder, the magnetic encoder...

