Javascript must be enabled to continue!
Deep Attention Models for Human Tracking Using RGBD
View through CrossRef
Visual tracking performance has long been limited by the lack of better appearance models. These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreground colors are similar). This paper proposes a robust, adaptive appearance model which works accurately in situations of color camouflage, even in the presence of complex natural objects. The proposed model includes depth as an additional feature in a hierarchical modular neural framework for online object tracking. The model adapts to the confusing appearance by identifying the stable property of depth between the target and the surrounding object(s). The depth complements the existing RGB features in scenarios when RGB features fail to adapt, hence becoming unstable over a long duration of time. The parameters of the model are learned efficiently in the Deep network, which consists of three modules: (1) The spatial attention layer, which discards the majority of the background by selecting a region containing the object of interest; (2) the appearance attention layer, which extracts appearance and spatial information about the tracked object; and (3) the state estimation layer, which enables the framework to predict future object appearance and location. Three different models were trained and tested to analyze the effect of depth along with RGB information. Also, a model is proposed to utilize only depth as a standalone input for tracking purposes. The proposed models were also evaluated in real-time using KinectV2 and showed very promising results. The results of our proposed network structures and their comparison with the state-of-the-art RGB tracking model demonstrate that adding depth significantly improves the accuracy of tracking in a more challenging environment (i.e., cluttered and camouflaged environments). Furthermore, the results of depth-based models showed that depth data can provide enough information for accurate tracking, even without RGB information.
Title: Deep Attention Models for Human Tracking Using RGBD
Description:
Visual tracking performance has long been limited by the lack of better appearance models.
These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreground colors are similar).
This paper proposes a robust, adaptive appearance model which works accurately in situations of color camouflage, even in the presence of complex natural objects.
The proposed model includes depth as an additional feature in a hierarchical modular neural framework for online object tracking.
The model adapts to the confusing appearance by identifying the stable property of depth between the target and the surrounding object(s).
The depth complements the existing RGB features in scenarios when RGB features fail to adapt, hence becoming unstable over a long duration of time.
The parameters of the model are learned efficiently in the Deep network, which consists of three modules: (1) The spatial attention layer, which discards the majority of the background by selecting a region containing the object of interest; (2) the appearance attention layer, which extracts appearance and spatial information about the tracked object; and (3) the state estimation layer, which enables the framework to predict future object appearance and location.
Three different models were trained and tested to analyze the effect of depth along with RGB information.
Also, a model is proposed to utilize only depth as a standalone input for tracking purposes.
The proposed models were also evaluated in real-time using KinectV2 and showed very promising results.
The results of our proposed network structures and their comparison with the state-of-the-art RGB tracking model demonstrate that adding depth significantly improves the accuracy of tracking in a more challenging environment (i.
e.
, cluttered and camouflaged environments).
Furthermore, the results of depth-based models showed that depth data can provide enough information for accurate tracking, even without RGB information.
Related Results
Is a Fitbit a Diary? Self-Tracking and Autobiography
Is a Fitbit a Diary? Self-Tracking and Autobiography
Data becomes something of a mirror in which people see themselves reflected. (Sorapure 270)In a 2014 essay for The New Yorker, the humourist David Sedaris recounts an obsession spu...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
The role of procedural learning in stuttering
The role of procedural learning in stuttering
<p>This research study examined motor control and procedural learning abilities in the oral and manual motor systems of adults who stutter, using people with Parkinson's dise...
Neural network-based algorithm for door handle recognition using RGBD cameras
Neural network-based algorithm for door handle recognition using RGBD cameras
AbstractThe ability to recognize and interact with a variety of doorknob designs is an important component on the path to true robot adaptability, allowing robotic systems to effec...
Visual tracking algorithm based on template updating and dual feature enhancement
Visual tracking algorithm based on template updating and dual feature enhancement
Aiming at the problem of tracking failure due to target deformation, flipping and occlusion in visual tracking, a template updating algorithm based on image structural similarity i...
Performance of Correlational Filtering and Deep Learning Based Single Target Tracking Algorithms
Performance of Correlational Filtering and Deep Learning Based Single Target Tracking Algorithms
Visual target tracking is an important research element in the field of computer vision. The applications are very wide. In terms of the computer vision field, deep learning has ac...
A Long-Term Video Tracking Method for Group-Housed Pigs
A Long-Term Video Tracking Method for Group-Housed Pigs
Pig tracking provides strong support for refined management in pig farms. However, long and continuous multi-pig tracking is still extremely challenging due to occlusion, distortio...

