Javascript must be enabled to continue!
Open-Vocabulary Fine-Grained Hand Action Detection
View through CrossRef
In this work, we address the new challenge of open-vocabulary fine-grained hand action detection, which aims to recognize hand actions from both known and novel categories using textual descriptions. Traditional hand action detection methods are limited to closed-set detection, making it difficult for them to generalize to new, unseen hand action categories. While current open-vocabulary detection (OVD) methods are effective at detecting novel objects, they face challenges with fine-grained action recognition, particularly when data is limited and heterogeneous. This often leads to poor generalization and performance bias between base and novel categories. To address these issues, we propose a novel approach, Open-FGHA (Open-vocabulary Fine-Grained Hand Action), which learns to distinguish fine-grained features across multiple modalities from limited heterogeneous data. It then identifies optimal matching relationships among these features, enabling accurate open-vocabulary fine-grained hand action detection. Specifically, we introduce three key components: Hierarchical Heterogeneous Low-Rank Adaptation, Bidirectional Selection and Fusion Mechanism, and Cross-Modality Query Generator. These components work in unison to enhance the alignment and fusion of multimodal fine-grained features. Extensive experiments demonstrate that Open-FGHA outperforms existing OVD methods, showing its strong potential for open-vocabulary hand action detection. The source code is available at OV-FGHAD.
International Joint Conferences on Artificial Intelligence Organization
Title: Open-Vocabulary Fine-Grained Hand Action Detection
Description:
In this work, we address the new challenge of open-vocabulary fine-grained hand action detection, which aims to recognize hand actions from both known and novel categories using textual descriptions.
Traditional hand action detection methods are limited to closed-set detection, making it difficult for them to generalize to new, unseen hand action categories.
While current open-vocabulary detection (OVD) methods are effective at detecting novel objects, they face challenges with fine-grained action recognition, particularly when data is limited and heterogeneous.
This often leads to poor generalization and performance bias between base and novel categories.
To address these issues, we propose a novel approach, Open-FGHA (Open-vocabulary Fine-Grained Hand Action), which learns to distinguish fine-grained features across multiple modalities from limited heterogeneous data.
It then identifies optimal matching relationships among these features, enabling accurate open-vocabulary fine-grained hand action detection.
Specifically, we introduce three key components: Hierarchical Heterogeneous Low-Rank Adaptation, Bidirectional Selection and Fusion Mechanism, and Cross-Modality Query Generator.
These components work in unison to enhance the alignment and fusion of multimodal fine-grained features.
Extensive experiments demonstrate that Open-FGHA outperforms existing OVD methods, showing its strong potential for open-vocabulary hand action detection.
The source code is available at OV-FGHAD.
Related Results
Open-Vocabulary Fine-Grained Hand Action Detection
Open-Vocabulary Fine-Grained Hand Action Detection
In this work, we address the new challenge of open-vocabulary fine-grained hand action detection, which aims to recognize hand actions from both known and novel categories using te...
HSK 以外的汉语新词汇分类
HSK 以外的汉语新词汇分类
<p class="AA">As the number of new Chinese vocabulary increases year by year, mastering new vocabulary beyond the HSK (Global Chinese Proficiency Test) syllabus is crucial to...
Trajectory of Learning Academic Vocabulary: IT Undergraduates’ Vocabulary Learning Strategies and Performance at the Exam
Trajectory of Learning Academic Vocabulary: IT Undergraduates’ Vocabulary Learning Strategies and Performance at the Exam
Learning vocabulary is an integral part in language acquisition and acquisition of academic vocabulary is crucial for the success in an academic context. Therefore, many studies ha...
The neural basis of intelligence in fine-grained cortical topographies
The neural basis of intelligence in fine-grained cortical topographies
AbstractIntelligent thought is the product of efficient neural information processing, which is embedded in fine-grained, topographically-organized population responses and support...
THE EFFECTIVENESS OF VOCABULARY GAMES ON VOCABULARY ACQUISITION: A LITERATURE REVIEW
THE EFFECTIVENESS OF VOCABULARY GAMES ON VOCABULARY ACQUISITION: A LITERATURE REVIEW
Acquiring language skills, namely listening, reading, speaking and writing, is fundamentally dependent on the mastery of vocabulary. Thus, teachers' application of attractive, ente...
Imbalanced image classification algorithm based on fine-grained analysis
Imbalanced image classification algorithm based on fine-grained analysis
Fine-grained attribute analysis and data imbalance have always been research hotspots in the field of computer vision. Due to the complexity and diversity of fine-grained attribute...
FGEFNet: Fine-Grained Extraction and Flow Network for Crowd Counting
FGEFNet: Fine-Grained Extraction and Flow Network for Crowd Counting
Abstract
Crowd counting is an important application of artificial intelligence in computer graphics and one of the most challenging research areas in the field of computer ...
Alternative potassium source for the cultivation of ornamental sunflower
Alternative potassium source for the cultivation of ornamental sunflower
ABSTRACT Brazil is dependent on importation of fertilizers, especially the potassics. Rocks and minerals that contain nutrients have a potential for use in agriculture as fertilize...

