Javascript must be enabled to continue!
Applying Deep Learning Algorithms for Speech Recognition in Speech-Impaired Children
View through CrossRef
Abstract - Speech impairment affects millions of children worldwide, creating significant barriers to communication, education, and social development. This paper investigates the application of deep learning algorithms for automatic speech recognition (ASR) specifically adapted to the speech patterns of children with speech-language disorders. We evaluate and compare four deep learning architectures Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), Transformer-based models, and hybrid CNN-LSTM frameworks trained and validated on augmented speech corpora drawn from children with articulation disorders, dysarthria, and stuttering. Mel-frequency cepstral coefficients (MFCCs) and spectrogram features serve as primary input representations. Experimental results demonstrate that the hybrid CNN-LSTM model achieves the highest word error rate (WER) reduction, reaching 78.4% recognition accuracy on the test set, outperforming conventional Hidden Markov Model (HMM) baselines by over 31 percentage points. Transfer learning from adult speech corpora, combined with child-specific data augmentation, further improves robustness to irregular phoneme production. The findings confirm that deep learning-based ASR holds substantial promise as an assistive technology for speech-impaired children, with practical implications for therapeutic tools and inclusive educational platforms.
Key Words: speech recognition, deep learning, speech impairment, children, LSTM, CNN, transformer, assistive technology, MFCC, dysarthria.
Edtech Publishers (OPC) Private Limited
Title: Applying Deep Learning Algorithms for Speech Recognition in Speech-Impaired Children
Description:
Abstract - Speech impairment affects millions of children worldwide, creating significant barriers to communication, education, and social development.
This paper investigates the application of deep learning algorithms for automatic speech recognition (ASR) specifically adapted to the speech patterns of children with speech-language disorders.
We evaluate and compare four deep learning architectures Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), Transformer-based models, and hybrid CNN-LSTM frameworks trained and validated on augmented speech corpora drawn from children with articulation disorders, dysarthria, and stuttering.
Mel-frequency cepstral coefficients (MFCCs) and spectrogram features serve as primary input representations.
Experimental results demonstrate that the hybrid CNN-LSTM model achieves the highest word error rate (WER) reduction, reaching 78.
4% recognition accuracy on the test set, outperforming conventional Hidden Markov Model (HMM) baselines by over 31 percentage points.
Transfer learning from adult speech corpora, combined with child-specific data augmentation, further improves robustness to irregular phoneme production.
The findings confirm that deep learning-based ASR holds substantial promise as an assistive technology for speech-impaired children, with practical implications for therapeutic tools and inclusive educational platforms.
Key Words: speech recognition, deep learning, speech impairment, children, LSTM, CNN, transformer, assistive technology, MFCC, dysarthria.
Related Results
Lists, Spatial Practice and Assistive Technologies for the Blind
Lists, Spatial Practice and Assistive Technologies for the Blind
IntroductionSupermarkets are functionally challenging environments for people with vision impairments. A supermarket is likely to house an average of 45,000 products in a median fl...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Daniela Fenu Foerch: interview by Márcia Fusaro and Ana Maria Haddad Baptista
Daniela Fenu Foerch: interview by Márcia Fusaro and Ana Maria Haddad Baptista
EccoS Journal: Dr Foerch thank you very much for this interview. Could you start telling us about your professional background and what the WeFEEL project is?
Daniela Fenu Foerch:...
“The Earth Is Dying, Bro”
“The Earth Is Dying, Bro”
Climate Change and Children
Australian children are uniquely situated in a vast landscape that varies drastically across locations. Spanning multiple climatic zones—from cool tempe...
Parental compliance for speech therapy of their hearing-impaired children using cochlear implants
Parental compliance for speech therapy of their hearing-impaired children using cochlear implants
Background: Cochlear implant is a miraculous surgery to improve hearing in profound hearing-impaired children who derive no benefit from hearing aids and consequently present with ...
Upaya Guru dalam Meningkatkan Minat Membaca Anak pada Masa Adaptasi Kebiasaan Baru di BMBA AIUEO Batujajar Bandung
Upaya Guru dalam Meningkatkan Minat Membaca Anak pada Masa Adaptasi Kebiasaan Baru di BMBA AIUEO Batujajar Bandung
Abstract. Based on the PISA report which was just released 2019, Indonesia's reading score is ranked 72 out of 77 countries (liputan6.com,2019). This condition shows the poor inter...

