Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

EMNet: A Novel Few-Shot Image Classification Model with Enhanced Self-Correlation Attention and Multi-Branch Joint Module

View through CrossRef
In this research, inspired by the principles of biological visual attention mechanisms and swarm intelligence found in nature, we present an Enhanced Self-Correlation Attention and Multi-Branch Joint Module Network (EMNet), a novel model for few-shot image classification. Few-shot image classification aims to address the problem of image classification when data are limited. Traditional models require a large amount of labeled data for training, while few-shot learning trains models using only a small number of samples (just a few samples per class) to recognize new categories. EMNet shows its potential for bio-inspired algorithms in optimizing feature extraction and enhancing generalization capabilities. It features two key modules: Enhanced Self-Correlated Attention (ESCA) and Multi-Branch Joint Module (MBJ Module). EMNet tackles two main challenges in few-shot learning: how to make an effective important feature extraction and enhancement in images, and improving generalization to new categories. The ESCA module boosts the precision in extracting crucial local features, enhancing classification accuracy. The MBJ module focuses on shared features across images, emphasizing similarities within classes and subtle differences between them. This enhances model adaptability and generalization to new categories. Experimental results show that our model performs better than existing models in one-shot and five-shot tasks on mini-ImageNet, CUB-200, and CIFAR-FS datasets, which proves the proposed model to be an efficient end-to-end solution for few-shot image classification. In the five-way one-shot and five-way five-shot experiments on the CUB-200-2011 dataset, EMNet achieved classification accuracies that were 1.27 and 0.54 percentage points higher than those of RENet, respectively. In the five-way one-shot and five-way five-shot experiments on the miniImageNet dataset, EMNet’s classification accuracies were 0.02 and 0.48 percentage points higher than those of RENet, respectively. In the five-way one-shot and five-way five-shot experiments on the CIFAR-FS dataset, EMNet’s classification accuracies were 0.19 and 0.18 percentage points higher than those of RENet.
Title: EMNet: A Novel Few-Shot Image Classification Model with Enhanced Self-Correlation Attention and Multi-Branch Joint Module
Description:
In this research, inspired by the principles of biological visual attention mechanisms and swarm intelligence found in nature, we present an Enhanced Self-Correlation Attention and Multi-Branch Joint Module Network (EMNet), a novel model for few-shot image classification.
Few-shot image classification aims to address the problem of image classification when data are limited.
Traditional models require a large amount of labeled data for training, while few-shot learning trains models using only a small number of samples (just a few samples per class) to recognize new categories.
EMNet shows its potential for bio-inspired algorithms in optimizing feature extraction and enhancing generalization capabilities.
It features two key modules: Enhanced Self-Correlated Attention (ESCA) and Multi-Branch Joint Module (MBJ Module).
EMNet tackles two main challenges in few-shot learning: how to make an effective important feature extraction and enhancement in images, and improving generalization to new categories.
The ESCA module boosts the precision in extracting crucial local features, enhancing classification accuracy.
The MBJ module focuses on shared features across images, emphasizing similarities within classes and subtle differences between them.
This enhances model adaptability and generalization to new categories.
Experimental results show that our model performs better than existing models in one-shot and five-shot tasks on mini-ImageNet, CUB-200, and CIFAR-FS datasets, which proves the proposed model to be an efficient end-to-end solution for few-shot image classification.
In the five-way one-shot and five-way five-shot experiments on the CUB-200-2011 dataset, EMNet achieved classification accuracies that were 1.
27 and 0.
54 percentage points higher than those of RENet, respectively.
In the five-way one-shot and five-way five-shot experiments on the miniImageNet dataset, EMNet’s classification accuracies were 0.
02 and 0.
48 percentage points higher than those of RENet, respectively.
In the five-way one-shot and five-way five-shot experiments on the CIFAR-FS dataset, EMNet’s classification accuracies were 0.
19 and 0.
18 percentage points higher than those of RENet.

Related Results

Is a Fitbit a Diary? Self-Tracking and Autobiography
Is a Fitbit a Diary? Self-Tracking and Autobiography
Data becomes something of a mirror in which people see themselves reflected. (Sorapure 270)In a 2014 essay for The New Yorker, the humourist David Sedaris recounts an obsession spu...
Construction of Enhanced Recovery Training Module for Former Drug Addicts
Construction of Enhanced Recovery Training Module for Former Drug Addicts
Construction of an academic module requires few main objectives in the module construction which are Module Construction, Module Validity Assessment, Module Reliability Test, and M...
Study on hardness and wear resistance of shot peened AA7075-T6 aluminum alloy
Study on hardness and wear resistance of shot peened AA7075-T6 aluminum alloy
Abstract AA7075-T6 aluminum alloy samples were shot peened at various shot peening pressures in the range of 10–70 psi to study their mechanical and tribological ...
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Abstract Thoracic outlet syndrome (TOS) is a complex and often overlooked condition caused by the compression of neurovascular structures as they pass through the thoracic outlet. ...
Comparative Evaluation of Zero-Shot and Few-Shot Performance of Large Language Models in Low-Resource Language Machine Translation
Comparative Evaluation of Zero-Shot and Few-Shot Performance of Large Language Models in Low-Resource Language Machine Translation
Large language models (LLMs) have demonstrated remarkable translation capabilities for high-resource languages, yet their effectiveness on low-resource languages under varying prom...
Sifat Rantai Naik pada Modul r-Noetherian serta Keterkaitan Modul r-Noetherian dengan Modul Noetherian dan Modul Hampir Noetherian
Sifat Rantai Naik pada Modul r-Noetherian serta Keterkaitan Modul r-Noetherian dengan Modul Noetherian dan Modul Hampir Noetherian
Modules are algebraic structures formed from Abelian groups and rings as scalars. A module is a Noetherian module if it satisfies the ascending chain condition on its submodules. A...
Identify Cricket Shots using Machine Learning
Identify Cricket Shots using Machine Learning
Cricket shot detection is a game-changing technology that offers deep insights into player performance and match data, completely changing the way the sport is played. The main ele...

Back to Top