Javascript must be enabled to continue!
Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
View through CrossRef
Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception. Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues. Indeed, providing 3D image understanding capabilities to vehicles is an essential target in order to infer scene semantics in urban environments. One of the challenges that arises from the navigation task in naturalistic urban scenarios is the detection of road participants (e.g., cyclists, pedestrians and vehicles). In this regard, this paper tackles the detection and orientation estimation of cars, pedestrians and cyclists, employing the challenging and naturalistic KITTI images. This work proposes 3D-aware features computed from stereo color images in order to capture the appearance and depth peculiarities of the objects in road scenes. The successful part-based object detector, known as DPM, is extended to learn richer models from the 2.5D data (color and disparity), while also carrying out a detailed analysis of the training pipeline. A large set of experiments evaluate the proposals, and the best performing approach is ranked on the KITTI website. Indeed, this is the first work that reports results with stereo data for the KITTI object challenge, achieving increased detection ratios for the classes car and cyclist compared to a baseline DPM.
Title: Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
Description:
Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception.
Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues.
Indeed, providing 3D image understanding capabilities to vehicles is an essential target in order to infer scene semantics in urban environments.
One of the challenges that arises from the navigation task in naturalistic urban scenarios is the detection of road participants (e.
g.
, cyclists, pedestrians and vehicles).
In this regard, this paper tackles the detection and orientation estimation of cars, pedestrians and cyclists, employing the challenging and naturalistic KITTI images.
This work proposes 3D-aware features computed from stereo color images in order to capture the appearance and depth peculiarities of the objects in road scenes.
The successful part-based object detector, known as DPM, is extended to learn richer models from the 2.
5D data (color and disparity), while also carrying out a detailed analysis of the training pipeline.
A large set of experiments evaluate the proposals, and the best performing approach is ranked on the KITTI website.
Indeed, this is the first work that reports results with stereo data for the KITTI object challenge, achieving increased detection ratios for the classes car and cyclist compared to a baseline DPM.
Related Results
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Scene complexity modulates degree of feedback activity during object recognition in natural scenes
Scene complexity modulates degree of feedback activity during object recognition in natural scenes
Abstract
Object recognition is thought to be mediated by rapid feed-forward activation of object-selective cortex, with limited contribution of feedback. However, d...
Territories -in- between
Territories -in- between
There is an increasing body of literature suggesting that the conventional idea of a gradual transition in spatial structure from urban to rural does not properly reflect contempor...
Causal neural mechanisms of context-based object recognition
Causal neural mechanisms of context-based object recognition
Objects can be recognized based on their intrinsic features, including shape, color, and texture. In daily life, however, such features are often not clearly visible, for example w...
Causal neural mechanisms of context-based object recognition
Causal neural mechanisms of context-based object recognition
ABSTRACT
Objects can be recognized based on their intrinsic features, including shape, color, and texture. In daily life, however, such features are often not clear...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...
A Comparative Study of VoxelNet and PointNet for 3D Object Detection in Car by Using KITTI Benchmark
A Comparative Study of VoxelNet and PointNet for 3D Object Detection in Car by Using KITTI Benchmark
In today's world, 2D object recognition is a normal course of study in research. 3D objection recognition is more in demand and important in the present scenario. 3D object recogni...
Adaptive Planning for Resilient Coastal Waterfronts
Adaptive Planning for Resilient Coastal Waterfronts
Many delta and coastal cities worldwide face increasing flood risk due to changing climate conditions and sea level rise. The question is how to develop measures and strategies for...

