Javascript must be enabled to continue!

Sign Language Recognition with Multimodal Sensors and Deep Learning Methods

Sign language recognition is essential in hearing-impaired people’s communication. Sign language recognition is an important concern in computer vision and has been developed with rapid progress in image recognition technology. However, sign language recognition using a general monocular camera has problems with occlusion and recognition accuracy in sign language recognition. In this research, we aim to improve accuracy by using a 2-axis bending sensor as an aid in addition to image recognition. We aim to achieve higher recognition accuracy by acquiring hand keypoint information of sign language actions captured by a monocular RGB camera and adding sensor assist. To improve sign language recognition, we need to propose new AI models. In addition, the amount of dataset is small because it uses the original data set of our laboratory. To learn using sensor data and image data, we used MediaPipe, CNN, and BiLSTM to perform sign language recognition. MediaPipe is a method for estimating the skeleton of the hand and face provided by Google. In addition, CNN is a method that can learn spatial information, and BiLSTM can learn time series data. Combining the CNN and BiLSTM methods yields higher recognition accuracy. We will use these techniques to learn hand skeletal information and sensor data. Additionally, the 2-axis Bending sensor glove data support training AI model. Using these methods, we aim to improve the recognition accuracy of sign language recognition by combining sensor data and hand skeleton data. Our method performed better than using skeletal information, achieving 96.5% accuracy in Top-1.

MDPI AG

Chenghong Lu Misaki Kozakai Lei Jing

2023

Title: Sign Language Recognition with Multimodal Sensors and Deep Learning Methods

Description:

Sign language recognition is essential in hearing-impaired people’s communication.

Sign language recognition is an important concern in computer vision and has been developed with rapid progress in image recognition technology.

However, sign language recognition using a general monocular camera has problems with occlusion and recognition accuracy in sign language recognition.

In this research, we aim to improve accuracy by using a 2-axis bending sensor as an aid in addition to image recognition.

We aim to achieve higher recognition accuracy by acquiring hand keypoint information of sign language actions captured by a monocular RGB camera and adding sensor assist.

To improve sign language recognition, we need to propose new AI models.

In addition, the amount of dataset is small because it uses the original data set of our laboratory.

To learn using sensor data and image data, we used MediaPipe, CNN, and BiLSTM to perform sign language recognition.

MediaPipe is a method for estimating the skeleton of the hand and face provided by Google.

In addition, CNN is a method that can learn spatial information, and BiLSTM can learn time series data.

Combining the CNN and BiLSTM methods yields higher recognition accuracy.

We will use these techniques to learn hand skeletal information and sensor data.

Additionally, the 2-axis Bending sensor glove data support training AI model.

Using these methods, we aim to improve the recognition accuracy of sign language recognition by combining sensor data and hand skeleton data.

Our method performed better than using skeletal information, achieving 96.

5% accuracy in Top-1.

Back

<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...

Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)

BACKGROUND Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...

Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga

The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...

Indian Sign Language Recognition System using GAN and Ensemble based approach

Abstract Sign language is the most prominent mode of communication for people who have hearing problems. Continuous sign language recognition is a poorly supervised job tha...

Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program

Abstract Funding Acknowledgements Type of funding sources: None. INTRODUCTION Patients with heart failure (HF)...

Learning to extract features for 2D – 3D multimodal registration

The ability to capture depth information form an scene has greatly increased in the recent years. 3D sensors, traditionally high cost and low resolution sensors, are being democrat...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Development of a multimodal imaging system based on LIDAR

(English) Perception of the environment is an essential requirement for the fields of autonomous vehicles and robotics, that claim for high amounts of data to make reliable decisio...

Email:
Password:

Email:

Sign Language Recognition with Multimodal Sensors and Deep Learning Methods

Related Results