Javascript must be enabled to continue!
AFE-RCNN: Adaptive Feature Enhancement RCNN for 3D Object Detection
View through CrossRef
The point clouds scanned by lidar are generally sparse, which can result in fewer sampling points of objects. To perform precise and effective 3D object detection, it is necessary to improve the feature representation ability to extract more feature information of the object points. Therefore, we propose an adaptive feature enhanced 3D object detection network based on point clouds (AFE-RCNN). AFE-RCNN is a point-voxel integrated network. We first voxelize the raw point clouds and obtain the voxel features through the 3D voxel convolutional neural network. Then, the 3D feature vectors are projected to the 2D bird’s eye view (BEV), and the relationship between the features in both spatial dimension and channel dimension is learned by the proposed residual of dual attention proposal generation module. The high-quality 3D box proposals are generated based on the BEV features and anchor-based approach. Next, we sample key points from raw point clouds to summarize the information of the voxel features, and obtain the key point features by the multi-scale feature extraction module based on adaptive feature adjustment. The neighboring contextual information is integrated into each key point through this module, and the robustness of feature processing is also guaranteed. Lastly, we aggregate the features of the BEV, voxels, and point clouds as the key point features that are used for proposal refinement. In addition, to ensure the correlation among the vertices of the bounding box, we propose a refinement loss function module with vertex associativity. Our AFE-RCNN exhibits comparable performance on the KITTI dataset and Waymo open dataset to state-of-the-art methods. On the KITTI 3D detection benchmark, for the moderate difficulty level of the car and the cyclist classes, the 3D detection mean average precisions of AFE-RCNN can reach 81.53% and 67.50%, respectively.
Title: AFE-RCNN: Adaptive Feature Enhancement RCNN for 3D Object Detection
Description:
The point clouds scanned by lidar are generally sparse, which can result in fewer sampling points of objects.
To perform precise and effective 3D object detection, it is necessary to improve the feature representation ability to extract more feature information of the object points.
Therefore, we propose an adaptive feature enhanced 3D object detection network based on point clouds (AFE-RCNN).
AFE-RCNN is a point-voxel integrated network.
We first voxelize the raw point clouds and obtain the voxel features through the 3D voxel convolutional neural network.
Then, the 3D feature vectors are projected to the 2D bird’s eye view (BEV), and the relationship between the features in both spatial dimension and channel dimension is learned by the proposed residual of dual attention proposal generation module.
The high-quality 3D box proposals are generated based on the BEV features and anchor-based approach.
Next, we sample key points from raw point clouds to summarize the information of the voxel features, and obtain the key point features by the multi-scale feature extraction module based on adaptive feature adjustment.
The neighboring contextual information is integrated into each key point through this module, and the robustness of feature processing is also guaranteed.
Lastly, we aggregate the features of the BEV, voxels, and point clouds as the key point features that are used for proposal refinement.
In addition, to ensure the correlation among the vertices of the bounding box, we propose a refinement loss function module with vertex associativity.
Our AFE-RCNN exhibits comparable performance on the KITTI dataset and Waymo open dataset to state-of-the-art methods.
On the KITTI 3D detection benchmark, for the moderate difficulty level of the car and the cyclist classes, the 3D detection mean average precisions of AFE-RCNN can reach 81.
53% and 67.
50%, respectively.
Related Results
[RETRACTED] Rhino XL Male Enhancement v1
[RETRACTED] Rhino XL Male Enhancement v1
[RETRACTED]Rhino XL Reviews, NY USA: Studies show that testosterone levels in males decrease constantly with growing age. There are also many other problems that males face due ...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Aeroengine Blade Surface Defect Detection System Based on Improved Faster RCNN
Aeroengine Blade Surface Defect Detection System Based on Improved Faster RCNN
Aiming at the difficulty of automatic blade detection and the discontinuous defects on the full image, an aeroengine blade surface defect detection system based on improved faster ...
Amniotic fluid embolism: A case-series
Amniotic fluid embolism: A case-series
Amniotic fluid embolism (AFE) is a rare but potentially catastrophic pregnancy complication. This is a 10-year retrospective study on women with AFE from 2014 to 2023. Cases that m...
Amniotic fluid embolism: a reappraisal
Amniotic fluid embolism: a reappraisal
Abstract
Objectives
Using cases from our own experience and from the published literature on amniotic fluid embolism (AFE), we s...
Cosmic ray muon clustering for the MicroBooNE liquid argon time projection chamber using sMask-RCNN
Cosmic ray muon clustering for the MicroBooNE liquid argon time projection chamber using sMask-RCNN
Abstract
In this article, we describe a modified implementation of Mask Region-based Convolutional Neural Networks (Mask-RCNN) for cosmic ray muon clustering in a li...
Deep learning for small object detection in images
Deep learning for small object detection in images
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT REQUEST OF AUTHOR.] With the rapid development of deep learning in computer vision, especially deep convolutional neural network...
REAL-TIME OBJECT DETECTION MODEL USING YOLOv10
REAL-TIME OBJECT DETECTION MODEL USING YOLOv10
From security systems to driverless cars, object detection is essential to many applications. The main goal of this project is to use YOLOv10 and RCNN (Region-Convolutional Neural ...

