Javascript must be enabled to continue!

Self-Supervised Contrastive Representation Learning in Computer Vision

Although its origins date a few decades back, contrastive learning has recently gained popularity due to its achievements in self-supervised learning, especially in computer vision. Supervised learning usually requires a decent amount of labeled data, which is not easy to obtain for many applications. With self-supervised learning, we can use inexpensive unlabeled data and achieve a training on a pretext task. Such a training helps us to learn powerful representations. In most cases, for a downstream task, self-supervised training is fine-tuned with the available amount of labeled data. In this study, we review common pretext and downstream tasks in computer vision and we present the latest self-supervised contrastive learning techniques, which are implemented as Siamese neural networks. Lastly, we present a case study where self-supervised contrastive learning was applied to learn representations of semantic masks of images. Performance was evaluated on an image retrieval task and results reveal that, in accordance with the findings in the literature, fine-tuning the self-supervised training showed the best performance.

IntechOpen

Yalin Bastanlar Semih Orhan

Artificial Intelligence

2022

Title: Self-Supervised Contrastive Representation Learning in Computer Vision

Description:

Although its origins date a few decades back, contrastive learning has recently gained popularity due to its achievements in self-supervised learning, especially in computer vision.

Supervised learning usually requires a decent amount of labeled data, which is not easy to obtain for many applications.

With self-supervised learning, we can use inexpensive unlabeled data and achieve a training on a pretext task.

Such a training helps us to learn powerful representations.

In most cases, for a downstream task, self-supervised training is fine-tuned with the available amount of labeled data.

In this study, we review common pretext and downstream tasks in computer vision and we present the latest self-supervised contrastive learning techniques, which are implemented as Siamese neural networks.

Lastly, we present a case study where self-supervised contrastive learning was applied to learn representations of semantic masks of images.

Performance was evaluated on an image retrieval task and results reveal that, in accordance with the findings in the literature, fine-tuning the self-supervised training showed the best performance.

Back

Related Results

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

Contrastive Distillation Learning with Sparse Spatial Aggregation

Abstract Contrastive learning has advanced significantly and demonstrates excellent transfer learning capabilities. Knowledge distillation is one of the most effective meth...

Improving Neural Retrieval with Contrastive Learning

In recent years, neural retrieval models have shown remarkable progress in improving the efficiency and accuracy of information retrieval systems. However, challenges remain in eff...

Analyzing Data Augmentation Techniques for Contrastive Learning in Recommender Models

This paper investigates the application of contrastive learning-based user and item representation learning in recommendation systems. A recommendation model combining contrastive ...

Contrastive Instruction-Trajectory Learning for Vision-Language Navigation

The vision-language navigation (VLN) task requires an agent to reach a target with the guidance of natural language instruction. Previous works learn to navigate step-by-step follo...

Self-Supervised Heterogeneous Graph Neural Network with Multi-Scale Meta-Path Contrastive Learning

Abstract Heterogeneous graph neural networks (HGNNs) exhibit remarkable capabilities in modeling complex structures and multi-semantic information. However, existing method...

Vision-specific and psychosocial impacts of low vision among patients with low vision at the eastern regional Low Vision Centre

Purpose: To determine vision-specific and psychosocial implications of low vision among patients with low vision visiting the Low Vision Centre of the Eastern Regional Hospital in ...

Heart Block Identification from 12-Lead ECG: Exploring the Generalizability of Self-Supervised AI

Abstract Timely diagnosis and treatment of heart blocks are critical for preventing fatal outcomes in patients with cardiac conduction disorders. Expert analysis of...

Email:
Password:

Email: