Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Spectral-Warping Based Noise-Robust Enhanced Children ASR System

View through CrossRef
Abstract In real-life applications, noise originating from different sound sources modifies the characteristics of an input signal which affects the development of an enhanced ASR system. This contamination degrades the quality and comprehension of speech variables while impacting the performance of human-machine communication systems. This paper aims to minimise noise challenges by using a robust feature extraction methodology through introduction of an optimised filtering technique. Initially, the evaluations for enhancing input signals are constructed by using state transformation matrix and minimising a mean square error based upon the linear time variance techniques of Kalman and Adaptive Wiener Filtering. Consequently, Mel-frequency cepstral coefficients (MFCC), Linear Predictive Cepstral Coefficient (LPCC), RelAtive SpecTrAl-Perceptual Linear Prediction (RASTA-PLP) and Gammatone Frequency cepstral coefficient (GFCC) based feature extraction methods have been synthesised with their comparable efficiency in order to derive the adequate characteristics of a signal. It also handle the large-scale training complexities lies among the training and testing dataset. Consequently, the acoustic mismatch and linguistic complexity of large-scale variations lies within small set of speakers have been handle by utilising the Vocal Tract Length Normalization (VTLN) based warping of the test utterances. Furthermore, the spectral warping approach has been used by time reversing the samples inside a frame and passing them into the filter network corresponding to each frame. Finally, the overall Relative Improvement (RI) of 16.13% on 5-way perturbed spectral warped based noise augmented dataset through Wiener Filtering in comparison to other systems respectively.
Title: Spectral-Warping Based Noise-Robust Enhanced Children ASR System
Description:
Abstract In real-life applications, noise originating from different sound sources modifies the characteristics of an input signal which affects the development of an enhanced ASR system.
This contamination degrades the quality and comprehension of speech variables while impacting the performance of human-machine communication systems.
This paper aims to minimise noise challenges by using a robust feature extraction methodology through introduction of an optimised filtering technique.
Initially, the evaluations for enhancing input signals are constructed by using state transformation matrix and minimising a mean square error based upon the linear time variance techniques of Kalman and Adaptive Wiener Filtering.
Consequently, Mel-frequency cepstral coefficients (MFCC), Linear Predictive Cepstral Coefficient (LPCC), RelAtive SpecTrAl-Perceptual Linear Prediction (RASTA-PLP) and Gammatone Frequency cepstral coefficient (GFCC) based feature extraction methods have been synthesised with their comparable efficiency in order to derive the adequate characteristics of a signal.
It also handle the large-scale training complexities lies among the training and testing dataset.
Consequently, the acoustic mismatch and linguistic complexity of large-scale variations lies within small set of speakers have been handle by utilising the Vocal Tract Length Normalization (VTLN) based warping of the test utterances.
Furthermore, the spectral warping approach has been used by time reversing the samples inside a frame and passing them into the filter network corresponding to each frame.
Finally, the overall Relative Improvement (RI) of 16.
13% on 5-way perturbed spectral warped based noise augmented dataset through Wiener Filtering in comparison to other systems respectively.

Related Results

Development of Parametric Model and Warping Analysis of Composite Beam with Multiple Rigid Regions
Development of Parametric Model and Warping Analysis of Composite Beam with Multiple Rigid Regions
Composite materials are used extensively in aircraft structures, automobiles, sporting goods, and many consumer products. Thin-walled multicell beams made of composite materials, h...
Mechanism of suppressing noise intensity of squeezed state enhancement
Mechanism of suppressing noise intensity of squeezed state enhancement
This research focuses on advanced noise suppression technologies for high-precision measurement systems, particularly addressing the limitations of classical noise reducing approac...
A Comprehensive Review of Noise Measurement, Standards, Assessment, Geospatial Mapping and Public Health
A Comprehensive Review of Noise Measurement, Standards, Assessment, Geospatial Mapping and Public Health
Noise pollution is an emerging issue in cities around the world. Noise is a pernicious pollutant in urban landscapes mainly due to the increasing number of city inhabitants, road a...
SU‐DD‐A4‐01: Multi‐Day Multi‐Modality Image Co‐Registration
SU‐DD‐A4‐01: Multi‐Day Multi‐Modality Image Co‐Registration
Purpose: To develop a methodology for multi‐day co‐registration of multi‐modality images taken throughout the course of radiotherapy, for the assessment of tumor response to treatm...
Family Pediatrics
Family Pediatrics
ABSTRACT/EXECUTIVE SUMMARYWhy a Task Force on the Family?The practice of pediatrics is unique among medical specialties in many ways, among which is the nearly certain presence of ...
An ensemble technique for speech recognition in noisy environments
An ensemble technique for speech recognition in noisy environments
<span>Automatic speech recognition (ASR) is a technology that allows a computer and mobile device to recognize and translate spoken language into text. ASR systems often prod...
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
AbstractIn processing of deep seismic reflection data, when the frequency band difference between the weak useful signal and noise both from the deep subsurface is very small and h...

Back to Top