Javascript must be enabled to continue!

PAEN: Efficient Pillar-based 3D Object Detector Based on Attention and Dilated Convolution

The Pillar-based 3D object detector can complete the scene-sensing task efficiently and quickly, meeting the basic real-time detection needs of the automatic driving sensing module. In this paper, we propose a Pillar Sequence Attention Encoder and Dilated Expansion Convolution Network. The former addresses issues of coarse encoding methods and limitations in encoding information during the pillar encoding stage, while the latter tackles the problem of insufficient receptive fields in the backbone network. Specifically, the Pillar Sequence Attention Encoder uses the Pillar Sequence Attention module (PSA) to capture attention information among points in the local region of the pillar and utilizes a Pillar Feature Soft Aggregation module (PFSA) to finely aggregate information from points within the pillar. The Dilated Expansion Convolution Network leverages dilated convolutions to capture feature information with both sparse and dense in wide-ranging receptive fields. We conducted experiments on the KITTI dataset to validate the performance of our model and the effectiveness of the proposed modules. Experiments show that our method achieved a mean average precision(mAP) of 81.48% for the car category, surpassing the baseline model by 3.12%, while the inference time only increases by about 10ms.

The Society of International Computing

Zhang Guanghao

Poster Volume Ⅰ The 2024 Twentieth International Conference on Intelligent Computing August 5-8, 2024 Tianjin, China

2025

Title: PAEN: Efficient Pillar-based 3D Object Detector Based on Attention and Dilated Convolution

Description:

The Pillar-based 3D object detector can complete the scene-sensing task efficiently and quickly, meeting the basic real-time detection needs of the automatic driving sensing module.

In this paper, we propose a Pillar Sequence Attention Encoder and Dilated Expansion Convolution Network.

The former addresses issues of coarse encoding methods and limitations in encoding information during the pillar encoding stage, while the latter tackles the problem of insufficient receptive fields in the backbone network.

Specifically, the Pillar Sequence Attention Encoder uses the Pillar Sequence Attention module (PSA) to capture attention information among points in the local region of the pillar and utilizes a Pillar Feature Soft Aggregation module (PFSA) to finely aggregate information from points within the pillar.

The Dilated Expansion Convolution Network leverages dilated convolutions to capture feature information with both sparse and dense in wide-ranging receptive fields.

We conducted experiments on the KITTI dataset to validate the performance of our model and the effectiveness of the proposed modules.

Experiments show that our method achieved a mean average precision(mAP) of 81.

48% for the car category, surpassing the baseline model by 3.

12%, while the inference time only increases by about 10ms.

Back

Abstract Affected by weakening effect of water in the goaf, the bearing capacity of coal pillar reduced, and coal pillar rock burst is prone to occur, which is a serious th...

Characterization of a novel HgCdTe focal plane array for ground and space astronomy through innovative infrared setups

(English) Nowadays, mercury-cadmium-telluride (MCT) short-wave infrared (SWIR) detectors are widely used in cutting-edge space missions and ground-based telescopes. They take adva...

Conceptual design report of the MPD Cosmic Ray Detector (MCORD)

Abstract This report presents a concept of constructing a detector dedicated for detection of muons observed during measurements carried out at the MPD (Multi-Pu...

Lightweight Design of Patch Plate on Car-body B-pillar based on Side Impact Safety

The B-pillar of automobile needs to meet the requirements of vehicle strength and rigidity, and also consider the fuel economy of vehicle. Therefore, the design and development of ...

Supercapacitive MnO2/PEDOT: PSS Modified 3D-Printed Polymeric Micro-Pillar Electrode for Extraction of Photosynthetic Electrons

Photosynthetic bio-electrochemical cells (PBECs) have been reported as having promising potential for the renewable energy field. When photosynthesis occurs, photosynthetic electro...

Graph convolutional neural networks for 3D data analysis

(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...

Neutron holography simulation based on different sample rotations

Neutron holography is a new imaging technique based on the recording of the interference pattern of two coherent waves emitted by the same source, which allows observing the spatia...

Dilated convolution with learnable spacings

Convolution dilatée avec espacements apprenables Dans cette thèse, nous avons développé et étudié la méthode de convolution dilatée avec espacements apprenables (Di...

Email:
Password:

Email:

PAEN: Efficient Pillar-based 3D Object Detector Based on Attention and Dilated Convolution

Related Results