Javascript must be enabled to continue!
Monocular Depth Estimation for Vehicles with mounted camera in Mixed Traffic conditions
View through CrossRef
Abstract
Depth estimation is crucial for computer vision applications like autonomous driving. While traditional methods such as LiDAR and radar are expensive, making monocular depth estimation a more cost-efficient alternative. However, deriving accurate depth from a single image is challenging due to its under-constrained nature. Monocular cues like perspective, scaling, and occlusion aid human depth perception, which deep learning-based models leverage to map image features to depth values. This research addresses the complexities of monocular depth estimation in mixed traffic conditions commonly found on Indian roads, with diverse vehicle classes, road surfaces, and unpredictable obstacles. Traditional methods often struggle in these scenarios. To overcome this, our study integrates object detection with deep learning models to estimate vehicle distances from frontal camera views. Validated using dashcam and drone footage, the proposed approach achieves an RMSE below 4 meters for both training and testing datasets. Moreover, the ensemble models reduced RMSE by up to 60% and improved the \(\textnormal{R}^\textnormal{2}\) value by 40%. This solution significantly enhances the spatial awareness of autonomous vehicles, providing a robust means of navigating heterogeneous traffic environments.
Springer Science and Business Media LLC
Title: Monocular Depth Estimation for Vehicles with
mounted camera in Mixed Traffic conditions
Description:
Abstract
Depth estimation is crucial for computer vision applications like autonomous driving.
While traditional methods such as LiDAR and radar are expensive, making monocular depth estimation a more cost-efficient alternative.
However, deriving accurate depth from a single image is challenging due to its under-constrained nature.
Monocular cues like perspective, scaling, and occlusion aid human depth perception, which deep learning-based models leverage to map image features to depth values.
This research addresses the complexities of monocular depth estimation in mixed traffic conditions commonly found on Indian roads, with diverse vehicle classes, road surfaces, and unpredictable obstacles.
Traditional methods often struggle in these scenarios.
To overcome this, our study integrates object detection with deep learning models to estimate vehicle distances from frontal camera views.
Validated using dashcam and drone footage, the proposed approach achieves an RMSE below 4 meters for both training and testing datasets.
Moreover, the ensemble models reduced RMSE by up to 60% and improved the \(\textnormal{R}^\textnormal{2}\) value by 40%.
This solution significantly enhances the spatial awareness of autonomous vehicles, providing a robust means of navigating heterogeneous traffic environments.
Related Results
Monocular Depth Estimation (Literature Review)
Monocular Depth Estimation (Literature Review)
Background. The physiological basis of spatial perception is traditionally attributed to the binocular system, which integrates the signals coming to the brain from each eye into a...
Smart Traffic Control Using Computer Vision
Smart Traffic Control Using Computer Vision
A Smart Traffic Control System using Computer Vision utilizes cameras, image processing techniques, and machine learning algorithms to monitor, analyze, and manage traffic flow aut...
Autonomous Vehicles in Mixed Traffic Conditions—A Bibliometric Analysis
Autonomous Vehicles in Mixed Traffic Conditions—A Bibliometric Analysis
Autonomous Vehicles (AVs) with their immaculate sensing and navigating capabilities are expected to revolutionize urban mobility. Despite the expected benefits, this emerging techn...
Pose estimation with event camera
Pose estimation with event camera
Estimation de la pose avec une caméra évènementielle
La pose de la caméra est utilisée pour décrire la position et l'orientation d'une caméra dans un système de coo...
Machine learning techniques for forensic camera model identification and anti-forensic attacks
Machine learning techniques for forensic camera model identification and anti-forensic attacks
The goal of camera model identification is to determine the manufacturer and model of an image's source camera. Camera model identification is an important task in multimedia foren...
Monocular Vision-Based Obstacle Height Estimation for Mobile Robot
Monocular Vision-Based Obstacle Height Estimation for Mobile Robot
To ensure that robots can operate reliably in diverse environments, obstacle detection is essential, which requires the acquisition of depth information of the surrounding environm...
Traffic safety outcomes of traffic law application and the adoption of new technology in traffic control
Traffic safety outcomes of traffic law application and the adoption of new technology in traffic control
Experience of the State of Qatar
Introduction:
Since the second half of the last decade of the twentieth century, Qatar has witnessed the
implementation of a comprehensive developm...
OPTIMAL TRAFFIC CONTROL SYSTEM FOR TRAFFIC CONGESTION
OPTIMAL TRAFFIC CONTROL SYSTEM FOR TRAFFIC CONGESTION
A smart city's traffic management system is regarded as one of its primary components. Traffic jams are a common sight on the roadways in metropolitan areas due to the rapid increa...

