Javascript must be enabled to continue!
Real-Time Object Detection using an Ensemble of One Stage and Two Stage Object Detection Models with Dynamic Fine-tuning using Kullback-Leibler Divergence
View through CrossRef
Real-time object detection is a very challenging task, as it requires
both high accuracy and high speed. One-stage object detectors such as
YOLO models are very fast but they are also less accurate than two-stage
object detectors such as Faster R-CNN. However, Faster R-CNN is not as
fast as the YOLO models. In this study, we propose an ensemble approach
to real-time object detection that combines the strengths of YOLOv5 and
Faster R-CNN. We first use YOLOv5 to quickly generate a set of object
proposals. We then use Faster R-CNN to refine these proposals and
produce more accurate object detection results. To further improve the
accuracy of our object detection results, we propose a cascade
refinement network that uses dynamic fine-tuning. The cascade refinement
network uses Kullback-Leibler divergence to dynamically adjust the
weights of the Faster R-CNN model based on the confidence scores of the
YOLOv5 object proposals. We evaluated our proposed approach on the novel
dataset collected in Uganda with other State-of-the-art approaches which
include RetinaNet, Cascade R-CNN, Single-Shot MultiBox Detector (SSD),
and Region-based Convolutional Neural Network (R-CNN). Experimental
results revealed that the proposed ensemble model outperformed both base
models with an average precision of 0.96, which is significantly higher
than the average precision of 0.91 for YOLOv5 and 0.90 for Faster R-CNN.
The ensemble model was also able to achieve real-time inference speeds,
processing frames at a rate of 25 frames per second, the same speed
achieved by YOLOv5, faster than the speed of 15 frames per second by
Faster R-CNN. The results also revealed that the proposed ensemble model
is comparable to other state-of-the-art object detection models. Our
proposed approach can be used to improve the accuracy and speed of
real-time object detection in a variety of applications.
Title: Real-Time Object Detection using an Ensemble of One Stage and Two Stage Object Detection Models with Dynamic Fine-tuning using Kullback-Leibler Divergence
Description:
Real-time object detection is a very challenging task, as it requires
both high accuracy and high speed.
One-stage object detectors such as
YOLO models are very fast but they are also less accurate than two-stage
object detectors such as Faster R-CNN.
However, Faster R-CNN is not as
fast as the YOLO models.
In this study, we propose an ensemble approach
to real-time object detection that combines the strengths of YOLOv5 and
Faster R-CNN.
We first use YOLOv5 to quickly generate a set of object
proposals.
We then use Faster R-CNN to refine these proposals and
produce more accurate object detection results.
To further improve the
accuracy of our object detection results, we propose a cascade
refinement network that uses dynamic fine-tuning.
The cascade refinement
network uses Kullback-Leibler divergence to dynamically adjust the
weights of the Faster R-CNN model based on the confidence scores of the
YOLOv5 object proposals.
We evaluated our proposed approach on the novel
dataset collected in Uganda with other State-of-the-art approaches which
include RetinaNet, Cascade R-CNN, Single-Shot MultiBox Detector (SSD),
and Region-based Convolutional Neural Network (R-CNN).
Experimental
results revealed that the proposed ensemble model outperformed both base
models with an average precision of 0.
96, which is significantly higher
than the average precision of 0.
91 for YOLOv5 and 0.
90 for Faster R-CNN.
The ensemble model was also able to achieve real-time inference speeds,
processing frames at a rate of 25 frames per second, the same speed
achieved by YOLOv5, faster than the speed of 15 frames per second by
Faster R-CNN.
The results also revealed that the proposed ensemble model
is comparable to other state-of-the-art object detection models.
Our
proposed approach can be used to improve the accuracy and speed of
real-time object detection in a variety of applications.
Related Results
Statistical Divergences between Densities of Truncated Exponential Families with Nested Supports: Duo Bregman and Duo Jensen Divergences
Statistical Divergences between Densities of Truncated Exponential Families with Nested Supports: Duo Bregman and Duo Jensen Divergences
By calculating the Kullback–Leibler divergence between two probability measures belonging to different exponential families dominated by the same measure, we obtain a formula that ...
Electric field tuning characteristic of multiple optical parametric oscillator based on MgO:QPLN
Electric field tuning characteristic of multiple optical parametric oscillator based on MgO:QPLN
The quasi-phase matching optical parametric oscillator tuning methods, i.e. grating period tuning, temperature tuning, pumping wavelength tuning, and angle tuning are more simple a...
Utilizing Amari-Alpha Divergence to Stabilize the Training of Generative Adversarial Networks
Utilizing Amari-Alpha Divergence to Stabilize the Training of Generative Adversarial Networks
Generative Adversarial Nets (GANs) are one of the most popular architectures for image generation, which has achieved significant progress in generating high-resolution, diverse im...
Sensory Evaluation of Odor Approximation Using NMF with Kullback-Leibler Divergence and Itakura-Saito Divergence in Mass Spectrum Space
Sensory Evaluation of Odor Approximation Using NMF with Kullback-Leibler Divergence and Itakura-Saito Divergence in Mass Spectrum Space
The odor reproduction can be achieved by approximating mass spectra of different odors by blending a set of odor components. The method enables us to create various odors by adjust...
Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence
Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence
Abstract
We introduce a novel methodology for robust Bayesian estimation with robust divergence (e.g., density power divergence or γ-divergence), indexed by tuning paramete...
Instruction Tuning on Large Language Models to Improve Reasoning Performance
Instruction Tuning on Large Language Models to Improve Reasoning Performance
The growing demand for natural language processing models capable of understanding and executing complex instructions has driven significant advancements in model fine-tuning tech...
Adaptive Multi-source Domain Collaborative Fine-tuning for Transfer Learning
Adaptive Multi-source Domain Collaborative Fine-tuning for Transfer Learning
Fine-tuning is an important technique in transfer learning that has achieved significant success in tasks that lack training data. However, as it is difficult to extract effective ...
A Federated Learning-based Optic Disc and Cup Segmentation Model for Glaucoma Monitoring In Color Fundus Photographs
A Federated Learning-based Optic Disc and Cup Segmentation Model for Glaucoma Monitoring In Color Fundus Photographs
ABSTRACT
Importance
Glaucoma, a leading cause of blindness worldwide, depends on accurate optic nerve head assessment, particul...

