Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Applying Deep Learning Algorithms for Speech Recognition in Speech-Impaired Children

View through CrossRef
Abstract - Speech impairment affects millions of children worldwide, creating significant barriers to communication, education, and social development. This paper investigates the application of deep learning algorithms for automatic speech recognition (ASR) specifically adapted to the speech patterns of children with speech-language disorders. We evaluate and compare four deep learning architectures Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), Transformer-based models, and hybrid CNN-LSTM frameworks trained and validated on augmented speech corpora drawn from children with articulation disorders, dysarthria, and stuttering. Mel-frequency cepstral coefficients (MFCCs) and spectrogram features serve as primary input representations. Experimental results demonstrate that the hybrid CNN-LSTM model achieves the highest word error rate (WER) reduction, reaching 78.4% recognition accuracy on the test set, outperforming conventional Hidden Markov Model (HMM) baselines by over 31 percentage points. Transfer learning from adult speech corpora, combined with child-specific data augmentation, further improves robustness to irregular phoneme production. The findings confirm that deep learning-based ASR holds substantial promise as an assistive technology for speech-impaired children, with practical implications for therapeutic tools and inclusive educational platforms. Key Words: speech recognition, deep learning, speech impairment, children, LSTM, CNN, transformer, assistive technology, MFCC, dysarthria.
Title: Applying Deep Learning Algorithms for Speech Recognition in Speech-Impaired Children
Description:
Abstract - Speech impairment affects millions of children worldwide, creating significant barriers to communication, education, and social development.
This paper investigates the application of deep learning algorithms for automatic speech recognition (ASR) specifically adapted to the speech patterns of children with speech-language disorders.
We evaluate and compare four deep learning architectures Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), Transformer-based models, and hybrid CNN-LSTM frameworks trained and validated on augmented speech corpora drawn from children with articulation disorders, dysarthria, and stuttering.
Mel-frequency cepstral coefficients (MFCCs) and spectrogram features serve as primary input representations.
Experimental results demonstrate that the hybrid CNN-LSTM model achieves the highest word error rate (WER) reduction, reaching 78.
4% recognition accuracy on the test set, outperforming conventional Hidden Markov Model (HMM) baselines by over 31 percentage points.
Transfer learning from adult speech corpora, combined with child-specific data augmentation, further improves robustness to irregular phoneme production.
The findings confirm that deep learning-based ASR holds substantial promise as an assistive technology for speech-impaired children, with practical implications for therapeutic tools and inclusive educational platforms.
Key Words: speech recognition, deep learning, speech impairment, children, LSTM, CNN, transformer, assistive technology, MFCC, dysarthria.

Related Results

Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
“The Earth Is Dying, Bro”
“The Earth Is Dying, Bro”
Climate Change and Children Australian children are uniquely situated in a vast landscape that varies drastically across locations. Spanning multiple climatic zones—from cool tempe...
Lapse kuvandist täiskasvanute ja laste endi pilgu läbi
Lapse kuvandist täiskasvanute ja laste endi pilgu läbi
The article analyses the image of the child as perceived from the perspective of children and adults and determines to what extent the perceptions vary between the children and adu...
Reflections Of Zoltan P. Dienes On Mathematics Education
Reflections Of Zoltan P. Dienes On Mathematics Education
The name of Zoltan P. Dienes (1916- ) stands with those ofJean Piaget, Jerome Bruner, Edward Begle, and Robert Davis as legendary figures whose work left a lasting impression on th...
Family Pediatrics
Family Pediatrics
ABSTRACT/EXECUTIVE SUMMARYWhy a Task Force on the Family?The practice of pediatrics is unique among medical specialties in many ways, among which is the nearly certain presence of ...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find ...

Back to Top