Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Object Tracking in RGB-T Videos Using Modal-Aware Attention Network and Competitive Learning

View through CrossRef
Object tracking in RGB-thermal (RGB-T) videos is increasingly used in many fields due to the all-weather and all-day working capability of the dual-modality imaging system, as well as the rapid development of low-cost and miniaturized infrared camera technology. However, it is still very challenging to effectively fuse dual-modality information to build a robust RGB-T tracker. In this paper, an RGB-T object tracking algorithm based on a modal-aware attention network and competitive learning (MaCNet) is proposed, which includes a feature extraction network, modal-aware attention network, and classification network. The feature extraction network adopts the form of a two-stream network to extract features from each modality image. The modal-aware attention network integrates the original data, establishes an attention model that characterizes the importance of different feature layers, and then guides the feature fusion to enhance the information interaction between modalities. The classification network constructs a modality-egoistic loss function through three parallel binary classifiers acting on the RGB branch, the thermal infrared branch, and the fusion branch, respectively. Guided by the training strategy of competitive learning, the entire network is fine-tuned in the direction of the optimal fusion of the dual modalities. Extensive experiments on several publicly available RGB-T datasets show that our tracker has superior performance compared to other latest RGB-T and RGB tracking approaches.
Title: Object Tracking in RGB-T Videos Using Modal-Aware Attention Network and Competitive Learning
Description:
Object tracking in RGB-thermal (RGB-T) videos is increasingly used in many fields due to the all-weather and all-day working capability of the dual-modality imaging system, as well as the rapid development of low-cost and miniaturized infrared camera technology.
However, it is still very challenging to effectively fuse dual-modality information to build a robust RGB-T tracker.
In this paper, an RGB-T object tracking algorithm based on a modal-aware attention network and competitive learning (MaCNet) is proposed, which includes a feature extraction network, modal-aware attention network, and classification network.
The feature extraction network adopts the form of a two-stream network to extract features from each modality image.
The modal-aware attention network integrates the original data, establishes an attention model that characterizes the importance of different feature layers, and then guides the feature fusion to enhance the information interaction between modalities.
The classification network constructs a modality-egoistic loss function through three parallel binary classifiers acting on the RGB branch, the thermal infrared branch, and the fusion branch, respectively.
Guided by the training strategy of competitive learning, the entire network is fine-tuned in the direction of the optimal fusion of the dual modalities.
Extensive experiments on several publicly available RGB-T datasets show that our tracker has superior performance compared to other latest RGB-T and RGB tracking approaches.

Related Results

Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Short video platforms as sources of health information about cervical cancer: A content and quality analysis
Short video platforms as sources of health information about cervical cancer: A content and quality analysis
BackgroundThe development of short popular science video platforms helps people obtain health information, but no research has evaluated the information characteristics and quality...
Born To Die: Lana Del Rey, Beauty Queen or Gothic Princess?
Born To Die: Lana Del Rey, Beauty Queen or Gothic Princess?
Closer examination of contemporary art forms including music videos in addition to the Gothic’s literature legacy is essential, “as it is virtually impossible to ignore the relatio...
Contour Tracking
Contour Tracking
Abstract Object tracking is a fundamental problem in computer vision. It is generally required as a preprocessing step that is used to perform motion‐based object recogni...
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Objectives: The purpose of this study was to evaluate the different styles of learning preferred by undergraduate medical students from 1st to 5th year of Al-Tibri Medical College ...
Promoting Mask Use on TikTok: Descriptive, Cross-sectional Study (Preprint)
Promoting Mask Use on TikTok: Descriptive, Cross-sectional Study (Preprint)
BACKGROUND Over the past decade, there has been an increasing secular trend in the number of studies on social media and health. ...
The Scope of Nonsuicidal Self-Injury on YouTube
The Scope of Nonsuicidal Self-Injury on YouTube
OBJECTIVE: Nonsuicidal self-injury, the deliberate destruction of one's body tissue (eg, self-cutting, burning) without suicidal intent, has consistent rates rang...

Back to Top