Javascript must be enabled to continue!

EMNet: A Novel Few-Shot Image Classification Model with Enhanced Self-Correlation Attention and Multi-Branch Joint Module

In this research, inspired by the principles of biological visual attention mechanisms and swarm intelligence found in nature, we present an Enhanced Self-Correlation Attention and Multi-Branch Joint Module Network (EMNet), a novel model for few-shot image classification. Few-shot image classification aims to address the problem of image classification when data are limited. Traditional models require a large amount of labeled data for training, while few-shot learning trains models using only a small number of samples (just a few samples per class) to recognize new categories. EMNet shows its potential for bio-inspired algorithms in optimizing feature extraction and enhancing generalization capabilities. It features two key modules: Enhanced Self-Correlated Attention (ESCA) and Multi-Branch Joint Module (MBJ Module). EMNet tackles two main challenges in few-shot learning: how to make an effective important feature extraction and enhancement in images, and improving generalization to new categories. The ESCA module boosts the precision in extracting crucial local features, enhancing classification accuracy. The MBJ module focuses on shared features across images, emphasizing similarities within classes and subtle differences between them. This enhances model adaptability and generalization to new categories. Experimental results show that our model performs better than existing models in one-shot and five-shot tasks on mini-ImageNet, CUB-200, and CIFAR-FS datasets, which proves the proposed model to be an efficient end-to-end solution for few-shot image classification. In the five-way one-shot and five-way five-shot experiments on the CUB-200-2011 dataset, EMNet achieved classification accuracies that were 1.27 and 0.54 percentage points higher than those of RENet, respectively. In the five-way one-shot and five-way five-shot experiments on the miniImageNet dataset, EMNet’s classification accuracies were 0.02 and 0.48 percentage points higher than those of RENet, respectively. In the five-way one-shot and five-way five-shot experiments on the CIFAR-FS dataset, EMNet’s classification accuracies were 0.19 and 0.18 percentage points higher than those of RENet.

MDPI AG

Fufang Li Weixiang Zhang Yi Shang

Biomimetics

2025

Title: EMNet: A Novel Few-Shot Image Classification Model with Enhanced Self-Correlation Attention and Multi-Branch Joint Module

Description:

Few-shot image classification aims to address the problem of image classification when data are limited.

Traditional models require a large amount of labeled data for training, while few-shot learning trains models using only a small number of samples (just a few samples per class) to recognize new categories.

EMNet shows its potential for bio-inspired algorithms in optimizing feature extraction and enhancing generalization capabilities.

It features two key modules: Enhanced Self-Correlated Attention (ESCA) and Multi-Branch Joint Module (MBJ Module).

EMNet tackles two main challenges in few-shot learning: how to make an effective important feature extraction and enhancement in images, and improving generalization to new categories.

The ESCA module boosts the precision in extracting crucial local features, enhancing classification accuracy.

The MBJ module focuses on shared features across images, emphasizing similarities within classes and subtle differences between them.

This enhances model adaptability and generalization to new categories.

Experimental results show that our model performs better than existing models in one-shot and five-shot tasks on mini-ImageNet, CUB-200, and CIFAR-FS datasets, which proves the proposed model to be an efficient end-to-end solution for few-shot image classification.

In the five-way one-shot and five-way five-shot experiments on the CUB-200-2011 dataset, EMNet achieved classification accuracies that were 1.

27 and 0.

54 percentage points higher than those of RENet, respectively.

In the five-way one-shot and five-way five-shot experiments on the miniImageNet dataset, EMNet’s classification accuracies were 0.

02 and 0.

48 percentage points higher than those of RENet, respectively.

In the five-way one-shot and five-way five-shot experiments on the CIFAR-FS dataset, EMNet’s classification accuracies were 0.

19 and 0.

18 percentage points higher than those of RENet.

Back

Data becomes something of a mirror in which people see themselves reflected. (Sorapure 270)In a 2014 essay for The New Yorker, the humourist David Sedaris recounts an obsession spu...

Construction of Enhanced Recovery Training Module for Former Drug Addicts

Construction of an academic module requires few main objectives in the module construction which are Module Construction, Module Validity Assessment, Module Reliability Test, and M...

THE ‘PARENT’ IN THE PARENTING STYLE: A CORRELATIONAL STUDY EXPLORING THE IMPACT OF PARENTING ON SELF-CONCEPT OF THE ADOLESCENT (Preprint)

BACKGROUND The present research attempts to explore the dynamics of parent child relationship. The investigation aims at understanding the impact of parenti...

Study on hardness and wear resistance of shot peened AA7075-T6 aluminum alloy

Abstract AA7075-T6 aluminum alloy samples were shot peened at various shot peening pressures in the range of 10–70 psi to study their mechanical and tribological ...

Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review

Abstract Thoracic outlet syndrome (TOS) is a complex and often overlooked condition caused by the compression of neurovascular structures as they pass through the thoracic outlet. ...

Comparative Evaluation of Zero-Shot and Few-Shot Performance of Large Language Models in Low-Resource Language Machine Translation

Large language models (LLMs) have demonstrated remarkable translation capabilities for high-resource languages, yet their effectiveness on low-resource languages under varying prom...

Sifat Rantai Naik pada Modul r-Noetherian serta Keterkaitan Modul r-Noetherian dengan Modul Noetherian dan Modul Hampir Noetherian

Modules are algebraic structures formed from Abelian groups and rings as scalars. A module is a Noetherian module if it satisfies the ascending chain condition on its submodules. A...

Identify Cricket Shots using Machine Learning

Cricket shot detection is a game-changing technology that offers deep insights into player performance and match data, completely changing the way the sport is played. The main ele...

Email:
Password:

Email:

EMNet: A Novel Few-Shot Image Classification Model with Enhanced Self-Correlation Attention and Multi-Branch Joint Module

Related Results