Javascript must be enabled to continue!
AVS-YOLO: Object Detection in Aerial Visual Scene
View through CrossRef
Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection. Difficult objects include small objects, objects of scale variation and objects with serious background interference. Class imbalances come from the number of different classes of objects and sampling of positive and negative samples. Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy. In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed. By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism. On this basis, an algorithm with ideal speed and accuracy for object detection is obtained. To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT. The experimental results show that the AP of AVS-YOLO increases by 6.22% and 5.09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3. In addition, the AP of AVS-YOLO is 1.82% higher than that of YOLOv4 on the VisDrone2019 dataset. In terms of detection speed, AVS-YOLO can process 31.8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.1 frames per second for YOLOv3. Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.
Title: AVS-YOLO: Object Detection in Aerial Visual Scene
Description:
Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection.
Difficult objects include small objects, objects of scale variation and objects with serious background interference.
Class imbalances come from the number of different classes of objects and sampling of positive and negative samples.
Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy.
In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed.
By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism.
On this basis, an algorithm with ideal speed and accuracy for object detection is obtained.
To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT.
The experimental results show that the AP of AVS-YOLO increases by 6.
22% and 5.
09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3.
In addition, the AP of AVS-YOLO is 1.
82% higher than that of YOLOv4 on the VisDrone2019 dataset.
In terms of detection speed, AVS-YOLO can process 31.
8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.
1 frames per second for YOLOv3.
Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.
Related Results
Lightweight fruit detection algorithms for low‐power computing devices
Lightweight fruit detection algorithms for low‐power computing devices
Abstract
A lightweight fruit detection algorithm is important to ensure real‐time detection on low‐power computing devices while maintaining detection accuracy. I...
The impact of vision loss on attitudes toward autonomous vehicles: A vision-centric analysis
The impact of vision loss on attitudes toward autonomous vehicles: A vision-centric analysis
SIGNIFICANCE
Autonomous vehicles (AVs) have the promise to be an alternative transportation solution for those with vision loss. However, the impact of vision loss on t...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
TRPV4 calcium-permeable channel contributes to valve stiffening in aortic stenosis
TRPV4 calcium-permeable channel contributes to valve stiffening in aortic stenosis
AbstractAortic valve stenosis (AVS) is a progressive disease marked by fibrosis, inflammation, calcification, and stiffening of the aortic valve leaflets, leading to disrupted bloo...
Adaptive Drop Approaches to Train Spiking-YOLO Network for Traffic Flow Counting
Adaptive Drop Approaches to Train Spiking-YOLO Network for Traffic Flow Counting
Abstract
Traffic flow counting is an object detection problem. YOLO (" You Only Look Once ") is a popular object detection network. Spiking-YOLO converts the YOLO network f...
Using acid volatile sulfide–simultaneous extracted metals as an assessment tool for metal pollution risk in the upper gulf of Thailand
Using acid volatile sulfide–simultaneous extracted metals as an assessment tool for metal pollution risk in the upper gulf of Thailand
Surface sediment samples from 30 station in the upper Gulf of Thailand (U-GOT) and 16 stations at the Chao Phraya river mouth collected in August 2010 and March 2011, respectively...
SD-YOLO: A Lightweight and High-Performance Deep Model for Small and Dense Object Detection
SD-YOLO: A Lightweight and High-Performance Deep Model for Small and Dense Object Detection
Abstract
Object detection in remote sensing imagery from unmanned aerial vehicles (UAVs) is crucial yet challenging, demanding efficient algorithms for high accuracy and re...
YOLO-V2 (You Only Look Once)
YOLO-V2 (You Only Look Once)
The you-only-look-once (YOLO) v2 object detector uses a single stage object detection network. YOLO v2 is faster than other two-stage deep learning object detectors, such as region...

