Javascript must be enabled to continue!

AVS-YOLO: Object Detection in Aerial Visual Scene

Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection. Difficult objects include small objects, objects of scale variation and objects with serious background interference. Class imbalances come from the number of different classes of objects and sampling of positive and negative samples. Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy. In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed. By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism. On this basis, an algorithm with ideal speed and accuracy for object detection is obtained. To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT. The experimental results show that the AP of AVS-YOLO increases by 6.22% and 5.09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3. In addition, the AP of AVS-YOLO is 1.82% higher than that of YOLOv4 on the VisDrone2019 dataset. In terms of detection speed, AVS-YOLO can process 31.8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.1 frames per second for YOLOv3. Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.

World Scientific Pub Co Pte Ltd

You Ma Lin Chai Lizuo Jin Yafeng Yu Jun Yan

International Journal of Pattern Recognition and Artificial Intelligence

2022

Title: AVS-YOLO: Object Detection in Aerial Visual Scene

Description:

Difficult object detection and class imbalance in object detection are the two main challenges faced by aerial image object detection.

Difficult objects include small objects, objects of scale variation and objects with serious background interference.

Class imbalances come from the number of different classes of objects and sampling of positive and negative samples.

Due to these challenges, conventional object detection models usually cannot effectively detect objects in aerial images, especially in the balance between network speed and accuracy.

In this paper, the YOLOv3 network structure was improved and an object detection method under the aerial visual scene (AVS-YOLO) was proposed.

By introducing a type of densely connected feature pyramid strategy, a scale-aware attention module was constructed, considering both residual dense network blocks and the median-frequency-balancing mechanism.

On this basis, an algorithm with ideal speed and accuracy for object detection is obtained.

To verify the effectiveness of the algorithm, AVS-YOLO and YOLOv3 were both used to test the VisDrone-DET2019 and UAVDT.

The experimental results show that the AP of AVS-YOLO increases by 6.

22% and 5.

09% on the VisDrone2019 and UAVDT datasets, respectively, compared with YOLOv3.

In addition, the AP of AVS-YOLO is 1.

82% higher than that of YOLOv4 on the VisDrone2019 dataset.

In terms of detection speed, AVS-YOLO can process 31.

8 frames per second on a single Nvidia GTX 2080Ti GPU, compared with 44.

1 frames per second for YOLOv3.

Compared with the other one-stage network in the field of object detection, AVS-YOLO currently achieves the state-of-the-art performance with similar calculation amount on this dataset.

Back

Abstract A lightweight fruit detection algorithm is important to ensure real‐time detection on low‐power computing devices while maintaining detection accuracy. I...

The impact of vision loss on attitudes toward autonomous vehicles: A vision-centric analysis

SIGNIFICANCE Autonomous vehicles (AVs) have the promise to be an alternative transportation solution for those with vision loss. However, the impact of vision loss on t...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

TRPV4 calcium-permeable channel contributes to valve stiffening in aortic stenosis

AbstractAortic valve stenosis (AVS) is a progressive disease marked by fibrosis, inflammation, calcification, and stiffening of the aortic valve leaflets, leading to disrupted bloo...

Adaptive Drop Approaches to Train Spiking-YOLO Network for Traffic Flow Counting

Abstract Traffic flow counting is an object detection problem. YOLO (" You Only Look Once ") is a popular object detection network. Spiking-YOLO converts the YOLO network f...

Using acid volatile sulfide–simultaneous extracted metals as an assessment tool for metal pollution risk in the upper gulf of Thailand

Surface sediment samples from 30 station in the upper Gulf of Thailand (U-GOT) and 16 stations at the Chao Phraya river mouth collected in August 2010 and March 2011, respectively...

SD-YOLO: A Lightweight and High-Performance Deep Model for Small and Dense Object Detection

Abstract Object detection in remote sensing imagery from unmanned aerial vehicles (UAVs) is crucial yet challenging, demanding efficient algorithms for high accuracy and re...

YOLO-V2 (You Only Look Once)

The you-only-look-once (YOLO) v2 object detector uses a single stage object detection network. YOLO v2 is faster than other two-stage deep learning object detectors, such as region...

Email:
Password:

Email:

AVS-YOLO: Object Detection in Aerial Visual Scene

Related Results