Javascript must be enabled to continue!
AVS-YOLO: Object Detection in Aerial Visual Scene
View through CrossRef
Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection. Difficult objects include small objects, objects of scale variation and objects with serious background interference. Class imbalances come from the number of different classes of objects and sampling of positive and negative samples. Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy. In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed. By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism. On this basis, an algorithm with ideal speed and accuracy for object detection is obtained. To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT. The experimental results show that the AP of AVS-YOLO increases by 6.22% and 5.09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3. In addition, the AP of AVS-YOLO is 1.82% higher than that of YOLOv4 on the VisDrone2019 dataset. In terms of detection speed, AVS-YOLO can process 31.8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.1 frames per second for YOLOv3. Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.
Title: AVS-YOLO: Object Detection in Aerial Visual Scene
Description:
Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection.
Difficult objects include small objects, objects of scale variation and objects with serious background interference.
Class imbalances come from the number of different classes of objects and sampling of positive and negative samples.
Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy.
In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed.
By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism.
On this basis, an algorithm with ideal speed and accuracy for object detection is obtained.
To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT.
The experimental results show that the AP of AVS-YOLO increases by 6.
22% and 5.
09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3.
In addition, the AP of AVS-YOLO is 1.
82% higher than that of YOLOv4 on the VisDrone2019 dataset.
In terms of detection speed, AVS-YOLO can process 31.
8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.
1 frames per second for YOLOv3.
Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.
Related Results
Lightweight fruit detection algorithms for low‐power computing devices
Lightweight fruit detection algorithms for low‐power computing devices
Abstract
A lightweight fruit detection algorithm is important to ensure real‐time detection on low‐power computing devices while maintaining detection accuracy. I...
Transient receptor potential vanilloid 4 calcium channel contributes to valve stiffening in aortic stenosis
Transient receptor potential vanilloid 4 calcium channel contributes to valve stiffening in aortic stenosis
Aortic valve stenosis (AVS) is a progressive disease characterized by fibrosis, inflammation, calcification, and stiffening of the aortic valve leaflets, which leads to impaired bl...
Application of YOLO-v7 and YOLO-v8 Transfer Learning Models in Breast Lesion Classification and Diagnosis
Application of YOLO-v7 and YOLO-v8 Transfer Learning Models in Breast Lesion Classification and Diagnosis
Background:
Early detection of breast cancer and accurate assessment of lesions are key goals of imaging evaluation. Ultrasound is widely used, but its
diagnost...
Modelling Agrivoltaics in a climate perspective for water-energy-food nexus analysis
Modelling Agrivoltaics in a climate perspective for water-energy-food nexus analysis
Renewable energies (REs) are increasingly important in addressing the challenge of climate change. Their development and widespread use can significantly reduce greenhouse gas emis...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
TRPV4 calcium-permeable channel contributes to valve stiffening in aortic stenosis
TRPV4 calcium-permeable channel contributes to valve stiffening in aortic stenosis
Abstract
Aortic valve stenosis (AVS) is a progressive disease marked by fibrosis, inflammation, calcification, and stiffening of the aortic valve leaflets, leading ...
Perceptions of Autonomous Vehicles: A Case Study of Jordan
Perceptions of Autonomous Vehicles: A Case Study of Jordan
Technologies for automated driving have advanced rapidly in recent years. Autonomous Vehicles (AVs) are one example of these recent technologies that deploy elements such as sensor...
Yolo Versions Architecture: Review
Yolo Versions Architecture: Review
Deep learning techniques are used across a wide range of fields for several applications. In recent years, deep learning-based object detection from aerial or terr...

