Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Spectral-Warping Based Noise-Robust Enhanced Children ASR System

View through CrossRef
Abstract In real-life applications, noise originating from different sound sources modifies the characteristics of an input signal which affects the development of an enhanced ASR system. This contamination degrades the quality and comprehension of speech variables while impacting the performance of human-machine communication systems. This paper aims to minimise noise challenges by using a robust feature extraction methodology through introduction of an optimised filtering technique. Initially, the evaluations for enhancing input signals are constructed by using state transformation matrix and minimising a mean square error based upon the linear time variance techniques of Kalman and Adaptive Wiener Filtering. Consequently, Mel-frequency cepstral coefficients (MFCC), Linear Predictive Cepstral Coefficient (LPCC), RelAtive SpecTrAl-Perceptual Linear Prediction (RASTA-PLP) and Gammatone Frequency cepstral coefficient (GFCC) based feature extraction methods have been synthesised with their comparable efficiency in order to derive the adequate characteristics of a signal. It also handle the large-scale training complexities lies among the training and testing dataset. Consequently, the acoustic mismatch and linguistic complexity of large-scale variations lies within small set of speakers have been handle by utilising the Vocal Tract Length Normalization (VTLN) based warping of the test utterances. Furthermore, the spectral warping approach has been used by time reversing the samples inside a frame and passing them into the filter network corresponding to each frame. Finally, the overall Relative Improvement (RI) of 16.13% on 5-way perturbed spectral warped based noise augmented dataset through Wiener Filtering in comparison to other systems respectively.
Title: Spectral-Warping Based Noise-Robust Enhanced Children ASR System
Description:
Abstract In real-life applications, noise originating from different sound sources modifies the characteristics of an input signal which affects the development of an enhanced ASR system.
This contamination degrades the quality and comprehension of speech variables while impacting the performance of human-machine communication systems.
This paper aims to minimise noise challenges by using a robust feature extraction methodology through introduction of an optimised filtering technique.
Initially, the evaluations for enhancing input signals are constructed by using state transformation matrix and minimising a mean square error based upon the linear time variance techniques of Kalman and Adaptive Wiener Filtering.
Consequently, Mel-frequency cepstral coefficients (MFCC), Linear Predictive Cepstral Coefficient (LPCC), RelAtive SpecTrAl-Perceptual Linear Prediction (RASTA-PLP) and Gammatone Frequency cepstral coefficient (GFCC) based feature extraction methods have been synthesised with their comparable efficiency in order to derive the adequate characteristics of a signal.
It also handle the large-scale training complexities lies among the training and testing dataset.
Consequently, the acoustic mismatch and linguistic complexity of large-scale variations lies within small set of speakers have been handle by utilising the Vocal Tract Length Normalization (VTLN) based warping of the test utterances.
Furthermore, the spectral warping approach has been used by time reversing the samples inside a frame and passing them into the filter network corresponding to each frame.
Finally, the overall Relative Improvement (RI) of 16.
13% on 5-way perturbed spectral warped based noise augmented dataset through Wiener Filtering in comparison to other systems respectively.

Related Results

Development of Parametric Model and Warping Analysis of Composite Beam with Multiple Rigid Regions
Development of Parametric Model and Warping Analysis of Composite Beam with Multiple Rigid Regions
Composite materials are used extensively in aircraft structures, automobiles, sporting goods, and many consumer products. Thin-walled multicell beams made of composite materials, h...
Mechanism of suppressing noise intensity of squeezed state enhancement
Mechanism of suppressing noise intensity of squeezed state enhancement
This research focuses on advanced noise suppression technologies for high-precision measurement systems, particularly addressing the limitations of classical noise reducing approac...
A Comprehensive Review of Noise Measurement, Standards, Assessment, Geospatial Mapping and Public Health
A Comprehensive Review of Noise Measurement, Standards, Assessment, Geospatial Mapping and Public Health
Noise pollution is an emerging issue in cities around the world. Noise is a pernicious pollutant in urban landscapes mainly due to the increasing number of city inhabitants, road a...
Lapse kuvandist täiskasvanute ja laste endi pilgu läbi
Lapse kuvandist täiskasvanute ja laste endi pilgu läbi
The article analyses the image of the child as perceived from the perspective of children and adults and determines to what extent the perceptions vary between the children and adu...
SU‐DD‐A4‐01: Multi‐Day Multi‐Modality Image Co‐Registration
SU‐DD‐A4‐01: Multi‐Day Multi‐Modality Image Co‐Registration
Purpose: To develop a methodology for multi‐day co‐registration of multi‐modality images taken throughout the course of radiotherapy, for the assessment of tumor response to treatm...
An ensemble technique for speech recognition in noisy environments
An ensemble technique for speech recognition in noisy environments
<span>Automatic speech recognition (ASR) is a technology that allows a computer and mobile device to recognize and translate spoken language into text. ASR systems often prod...
Family Pediatrics
Family Pediatrics
ABSTRACT/EXECUTIVE SUMMARYWhy a Task Force on the Family?The practice of pediatrics is unique among medical specialties in many ways, among which is the nearly certain presence of ...

Back to Top