Javascript must be enabled to continue!
Sign Language Recognition with Multimodal Sensors and Deep Learning Methods
View through CrossRef
Sign language recognition is essential in hearing-impaired people’s communication. Sign language recognition is an important concern in computer vision and has been developed with rapid progress in image recognition technology. However, sign language recognition using a general monocular camera has problems with occlusion and recognition accuracy in sign language recognition. In this research, we aim to improve accuracy by using a 2-axis bending sensor as an aid in addition to image recognition. We aim to achieve higher recognition accuracy by acquiring hand keypoint information of sign language actions captured by a monocular RGB camera and adding sensor assist. To improve sign language recognition, we need to propose new AI models. In addition, the amount of dataset is small because it uses the original data set of our laboratory. To learn using sensor data and image data, we used MediaPipe, CNN, and BiLSTM to perform sign language recognition. MediaPipe is a method for estimating the skeleton of the hand and face provided by Google. In addition, CNN is a method that can learn spatial information, and BiLSTM can learn time series data. Combining the CNN and BiLSTM methods yields higher recognition accuracy. We will use these techniques to learn hand skeletal information and sensor data. Additionally, the 2-axis Bending sensor glove data support training AI model. Using these methods, we aim to improve the recognition accuracy of sign language recognition by combining sensor data and hand skeleton data. Our method performed better than using skeletal information, achieving 96.5% accuracy in Top-1.
Title: Sign Language Recognition with Multimodal Sensors and Deep Learning Methods
Description:
Sign language recognition is essential in hearing-impaired people’s communication.
Sign language recognition is an important concern in computer vision and has been developed with rapid progress in image recognition technology.
However, sign language recognition using a general monocular camera has problems with occlusion and recognition accuracy in sign language recognition.
In this research, we aim to improve accuracy by using a 2-axis bending sensor as an aid in addition to image recognition.
We aim to achieve higher recognition accuracy by acquiring hand keypoint information of sign language actions captured by a monocular RGB camera and adding sensor assist.
To improve sign language recognition, we need to propose new AI models.
In addition, the amount of dataset is small because it uses the original data set of our laboratory.
To learn using sensor data and image data, we used MediaPipe, CNN, and BiLSTM to perform sign language recognition.
MediaPipe is a method for estimating the skeleton of the hand and face provided by Google.
In addition, CNN is a method that can learn spatial information, and BiLSTM can learn time series data.
Combining the CNN and BiLSTM methods yields higher recognition accuracy.
We will use these techniques to learn hand skeletal information and sensor data.
Additionally, the 2-axis Bending sensor glove data support training AI model.
Using these methods, we aim to improve the recognition accuracy of sign language recognition by combining sensor data and hand skeleton data.
Our method performed better than using skeletal information, achieving 96.
5% accuracy in Top-1.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Indian Sign Language Recognition System using GAN and Ensemble based approach
Indian Sign Language Recognition System using GAN and Ensemble based approach
Abstract
Sign language is the most prominent mode of communication for people who have hearing problems. Continuous sign language recognition is a poorly supervised job tha...
Learning to extract features for 2D – 3D multimodal registration
Learning to extract features for 2D – 3D multimodal registration
The ability to capture depth information form an scene has greatly increased in the recent years. 3D sensors, traditionally high cost and low resolution sensors, are being democrat...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...
Development of a multimodal imaging system based on LIDAR
Development of a multimodal imaging system based on LIDAR
(English) Perception of the environment is an essential requirement for the fields of autonomous vehicles and robotics, that claim for high amounts of data to make reliable decisio...
Sign Language Linguistics
Sign Language Linguistics
Sign language linguistics is one of the younger areas of linguistic research, having been a field in its own right only since the 1960s, when the first research investigating sign ...

