Javascript must be enabled to continue!
PAEN: Efficient Pillar-based 3D Object Detector Based on Attention and Dilated Convolution
View through CrossRef
The Pillar-based 3D object detector can complete the scene-sensing task efficiently and quickly, meeting the basic real-time detection needs of the automatic driving sensing module. In this paper, we propose a Pillar Sequence Attention Encoder and Dilated Expansion Convolution Network. The former addresses issues of coarse encoding methods and limitations in encoding information during the pillar encoding stage, while the latter tackles the problem of insufficient receptive fields in the backbone network. Specifically, the Pillar Sequence Attention Encoder uses the Pillar Sequence Attention module (PSA) to capture attention information among points in the local region of the pillar and utilizes a Pillar Feature Soft Aggregation module (PFSA) to finely aggregate information from points within the pillar. The Dilated Expansion Convolution Network leverages dilated convolutions to capture feature information with both sparse and dense in wide-ranging receptive fields. We conducted experiments on the KITTI dataset to validate the performance of our model and the effectiveness of the proposed modules. Experiments show that our method achieved a mean average precision(mAP) of 81.48% for the car category, surpassing the baseline model by 3.12%, while the inference time only increases by about 10ms.
The Society of International Computing
Title: PAEN: Efficient Pillar-based 3D Object Detector Based on Attention and Dilated Convolution
Description:
The Pillar-based 3D object detector can complete the scene-sensing task efficiently and quickly, meeting the basic real-time detection needs of the automatic driving sensing module.
In this paper, we propose a Pillar Sequence Attention Encoder and Dilated Expansion Convolution Network.
The former addresses issues of coarse encoding methods and limitations in encoding information during the pillar encoding stage, while the latter tackles the problem of insufficient receptive fields in the backbone network.
Specifically, the Pillar Sequence Attention Encoder uses the Pillar Sequence Attention module (PSA) to capture attention information among points in the local region of the pillar and utilizes a Pillar Feature Soft Aggregation module (PFSA) to finely aggregate information from points within the pillar.
The Dilated Expansion Convolution Network leverages dilated convolutions to capture feature information with both sparse and dense in wide-ranging receptive fields.
We conducted experiments on the KITTI dataset to validate the performance of our model and the effectiveness of the proposed modules.
Experiments show that our method achieved a mean average precision(mAP) of 81.
48% for the car category, surpassing the baseline model by 3.
12%, while the inference time only increases by about 10ms.
Related Results
Research on water immersion damage characteristics and equivalent width of coal pillar
Research on water immersion damage characteristics and equivalent width of coal pillar
Abstract
Affected by weakening effect of water in the goaf, the bearing capacity of coal pillar reduced, and coal pillar rock burst is prone to occur, which is a serious th...
Characterization of a novel HgCdTe focal plane array for ground and space astronomy through innovative infrared setups
Characterization of a novel HgCdTe focal plane array for ground and space astronomy through innovative infrared setups
(English) Nowadays, mercury-cadmium-telluride (MCT) short-wave infrared (SWIR) detectors are widely used in cutting-edge space
missions and ground-based telescopes. They take adva...
Conceptual design report of the MPD Cosmic Ray Detector (MCORD)
Conceptual design report of the MPD Cosmic Ray Detector (MCORD)
Abstract
This report presents a concept of constructing a detector
dedicated for detection of muons observed during measurements
carried out at the MPD (Multi-Pu...
Lightweight Design of Patch Plate on Car-body B-pillar based on Side Impact Safety
Lightweight Design of Patch Plate on Car-body B-pillar based on Side Impact Safety
The B-pillar of automobile needs to meet the requirements of vehicle strength and rigidity, and also consider the fuel economy of vehicle. Therefore, the design and development of ...
Supercapacitive MnO2/PEDOT: PSS Modified 3D-Printed Polymeric Micro-Pillar Electrode for Extraction of Photosynthetic Electrons
Supercapacitive MnO2/PEDOT: PSS Modified 3D-Printed Polymeric Micro-Pillar Electrode for Extraction of Photosynthetic Electrons
Photosynthetic bio-electrochemical cells (PBECs) have been reported as having promising potential for the renewable energy field. When photosynthesis occurs, photosynthetic electro...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Neutron holography simulation based on different sample rotations
Neutron holography simulation based on different sample rotations
Neutron holography is a new imaging technique based on the recording of the interference pattern of two coherent waves emitted by the same source, which allows observing the spatia...
Dilated convolution with learnable spacings
Dilated convolution with learnable spacings
Convolution dilatée avec espacements apprenables
Dans cette thèse, nous avons développé et étudié la méthode de convolution dilatée avec espacements apprenables (Di...

