Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Sketch Recognition Using Mamba Model for Computer Vision Tasks

View through CrossRef
Abstract Sketch recognition involves classifying and retrieving hand-drawn sketches. Traditional deep learning models like CNNs, RNNs, and transformers often struggle due to unbalanced datasets and poor generalization across sketch styles and categories. These limitations hinder the development of effective systems. To address these challenges, we propose Mamba, a novel deep learning framework for sketch classification and retrieval. Mamba integrates CNNs for feature extraction, RNNs for capturing temporal dependencies, and a dedicated Mamba module that enhances visual attention mechanisms and feature activation mapping. Our dynamic refinement of sketch representations improves generalization and adaptability. We then trained on large-scale datasets such as QuickDraw, TU-Berlin, and SketchyScene. Mamba outperforms existing methods in terms of recognition accuracy, robustness, and interpretability. Our evaluations demonstrate that Mamba not only improves classification precision but also provides valuable insights into feature attribution. This framework is a promising approach for real-world sketch recognition applications, highlighting the importance of structured feature representation learning and attention mechanisms to enhance reliability, ease of use, and performance in sketch-based computer vision.
Title: Sketch Recognition Using Mamba Model for Computer Vision Tasks
Description:
Abstract Sketch recognition involves classifying and retrieving hand-drawn sketches.
Traditional deep learning models like CNNs, RNNs, and transformers often struggle due to unbalanced datasets and poor generalization across sketch styles and categories.
These limitations hinder the development of effective systems.
To address these challenges, we propose Mamba, a novel deep learning framework for sketch classification and retrieval.
Mamba integrates CNNs for feature extraction, RNNs for capturing temporal dependencies, and a dedicated Mamba module that enhances visual attention mechanisms and feature activation mapping.
Our dynamic refinement of sketch representations improves generalization and adaptability.
We then trained on large-scale datasets such as QuickDraw, TU-Berlin, and SketchyScene.
Mamba outperforms existing methods in terms of recognition accuracy, robustness, and interpretability.
Our evaluations demonstrate that Mamba not only improves classification precision but also provides valuable insights into feature attribution.
This framework is a promising approach for real-world sketch recognition applications, highlighting the importance of structured feature representation learning and attention mechanisms to enhance reliability, ease of use, and performance in sketch-based computer vision.

Related Results

Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Identifying Links Between Latent Memory and Speech Recognition Factors
Identifying Links Between Latent Memory and Speech Recognition Factors
Objectives: The link between memory ability and speech recognition accuracy is often examined by correlating summary measures of performance across various tasks, but i...
Book of paintings made in Portugal and Spain
Book of paintings made in Portugal and Spain
IE TCD MS 6208 is one of 5 sketchbooks kept by John Synge during his early 19th-century European travels. Where the works are finished they are sepia wash depictions of buildings i...
Vision-specific and psychosocial impacts of low vision among patients with low vision at the eastern regional Low Vision Centre
Vision-specific and psychosocial impacts of low vision among patients with low vision at the eastern regional Low Vision Centre
Purpose: To determine vision-specific and psychosocial implications of low vision among patients with low vision visiting the Low Vision Centre of the Eastern Regional Hospital in ...
SIMULATION OF COMPUTER VISION SYSTEMS WITH ARTIFICIAL INTELLIGENCE
SIMULATION OF COMPUTER VISION SYSTEMS WITH ARTIFICIAL INTELLIGENCE
. In recent years, the rapid development of artificial intelligence technologies has led to significant advances in the field of computer vision, dedicated to extracting valuable i...
Mechanisms Responsible for the Anticoagulant Properties of Neurotoxic Dendroaspis Venoms: A Viscoelastic Analysis
Mechanisms Responsible for the Anticoagulant Properties of Neurotoxic Dendroaspis Venoms: A Viscoelastic Analysis
Using thrombelastography to gain mechanistic insights, recent investigations have identified enzymes and compounds in Naja and Crotalus species’ neurotoxic venoms that are anticoag...
Learning manufacturing computer vision systems using tiny YOLOv4
Learning manufacturing computer vision systems using tiny YOLOv4
Implementing and deploying advanced technologies are principal in improving manufacturing processes, signifying a transformative stride in the industrial sector. Computer vision pl...
Decoding task representations that support generalization in hierarchical task
Decoding task representations that support generalization in hierarchical task
AbstractTask knowledge can be encoded hierarchically such that complex tasks can be built by associating simpler tasks. This associative organization supports generalization to fac...

Back to Top