Javascript must be enabled to continue!

Real-Time Object Detection using an Ensemble of One Stage and Two Stage Object Detection Models with Dynamic Fine-tuning using Kullback-Leibler Divergence

Real-time object detection is a very challenging task, as it requires both high accuracy and high speed. One-stage object detectors such as YOLO models are very fast but they are also less accurate than two-stage object detectors such as Faster R-CNN. However, Faster R-CNN is not as fast as the YOLO models. In this study, we propose an ensemble approach to real-time object detection that combines the strengths of YOLOv5 and Faster R-CNN. We first use YOLOv5 to quickly generate a set of object proposals. We then use Faster R-CNN to refine these proposals and produce more accurate object detection results. To further improve the accuracy of our object detection results, we propose a cascade refinement network that uses dynamic fine-tuning. The cascade refinement network uses Kullback-Leibler divergence to dynamically adjust the weights of the Faster R-CNN model based on the confidence scores of the YOLOv5 object proposals. We evaluated our proposed approach on the novel dataset collected in Uganda with other State-of-the-art approaches which include RetinaNet, Cascade R-CNN, Single-Shot MultiBox Detector (SSD), and Region-based Convolutional Neural Network (R-CNN). Experimental results revealed that the proposed ensemble model outperformed both base models with an average precision of 0.96, which is significantly higher than the average precision of 0.91 for YOLOv5 and 0.90 for Faster R-CNN. The ensemble model was also able to achieve real-time inference speeds, processing frames at a rate of 25 frames per second, the same speed achieved by YOLOv5, faster than the speed of 15 frames per second by Faster R-CNN. The results also revealed that the proposed ensemble model is comparable to other state-of-the-art object detection models. Our proposed approach can be used to improve the accuracy and speed of real-time object detection in a variety of applications.

Wiley

Lenard Nkalubo Rose Nakibuule Nixson Okila

2023

Title: Real-Time Object Detection using an Ensemble of One Stage and Two Stage Object Detection Models with Dynamic Fine-tuning using Kullback-Leibler Divergence

Description:

Real-time object detection is a very challenging task, as it requires both high accuracy and high speed.

One-stage object detectors such as YOLO models are very fast but they are also less accurate than two-stage object detectors such as Faster R-CNN.

However, Faster R-CNN is not as fast as the YOLO models.

In this study, we propose an ensemble approach to real-time object detection that combines the strengths of YOLOv5 and Faster R-CNN.

We first use YOLOv5 to quickly generate a set of object proposals.

We then use Faster R-CNN to refine these proposals and produce more accurate object detection results.

To further improve the accuracy of our object detection results, we propose a cascade refinement network that uses dynamic fine-tuning.

The cascade refinement network uses Kullback-Leibler divergence to dynamically adjust the weights of the Faster R-CNN model based on the confidence scores of the YOLOv5 object proposals.

We evaluated our proposed approach on the novel dataset collected in Uganda with other State-of-the-art approaches which include RetinaNet, Cascade R-CNN, Single-Shot MultiBox Detector (SSD), and Region-based Convolutional Neural Network (R-CNN).

Experimental results revealed that the proposed ensemble model outperformed both base models with an average precision of 0.

96, which is significantly higher than the average precision of 0.

91 for YOLOv5 and 0.

90 for Faster R-CNN.

The ensemble model was also able to achieve real-time inference speeds, processing frames at a rate of 25 frames per second, the same speed achieved by YOLOv5, faster than the speed of 15 frames per second by Faster R-CNN.

The results also revealed that the proposed ensemble model is comparable to other state-of-the-art object detection models.

Our proposed approach can be used to improve the accuracy and speed of real-time object detection in a variety of applications.

Back

By calculating the Kullback–Leibler divergence between two probability measures belonging to different exponential families dominated by the same measure, we obtain a formula that ...

Utilizing Amari-Alpha Divergence to Stabilize the Training of Generative Adversarial Networks

Generative Adversarial Nets (GANs) are one of the most popular architectures for image generation, which has achieved significant progress in generating high-resolution, diverse im...

Sensory Evaluation of Odor Approximation Using NMF with Kullback-Leibler Divergence and Itakura-Saito Divergence in Mass Spectrum Space

The odor reproduction can be achieved by approximating mass spectra of different odors by blending a set of odor components. The method enables us to create various odors by adjust...

Adaptive Multi-source Domain Collaborative Fine-tuning for Transfer Learning

Fine-tuning is an important technique in transfer learning that has achieved significant success in tasks that lack training data. However, as it is difficult to extract effective ...

Revisiting Fine-Tuning: A Survey of Parameter-Efficient Techniques for Large AI Models

Foundation models have revolutionized artificial intelligence by achieving state-of-the-art performance across a wide range of tasks. However, fine-tuning these massive models for ...

KL-MOB: Automated Covid-19 Recognition Using a Novel Approach Based on Image Enhancement and a Modified MobileNet CNN

ABSTRACT The emergence of the novel coronavirus pneumonia (Covid-19) pandemic at the end of 2019 led to chaos worldwide. The world breathed a sigh of relief when so...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

Families of Generalized Quasisymmetry Models: A ϕ-Divergence Approach

The quasisymmetry (QS) model for square contingency tables is revisited, highlighting properties and features on the basis of its alternative definitions. More parsimonious QS-type...

Email:
Password:

Email:

Real-Time Object Detection using an Ensemble of One Stage and Two Stage Object Detection Models with Dynamic Fine-tuning using Kullback-Leibler Divergence

Related Results