Javascript must be enabled to continue!
Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
View through CrossRef
The placement of a microphone at its correct position is crucial for
automatic speech recognition (ASR) in a single-microphone distant speech
recognition setup. Understandably, it is a practical problem that the
performance degrades due to accidental displacement of the microphone
from the correct position through routine use and maintenance. In this
paper, an approach is formulated which makes it possible to determine
the optimal position of the microphone, given the geometry of the room,
the location of the speaker, and the inexact location of the microphone.
The metric used to measure the correctness of the microphone position is
the average character error rate (cer) of the ASR system at that
position. The proposed approach uses limited available data which is
usually the case when dealing with microphone position error analysis
while considering room acoustics. A neural network classifier is trained
using limited data to associate the location of the microphone with the
corresponding class of cer values. The obtained prediction is used to
trace the microphone back to the optimal position.
Institute of Electrical and Electronics Engineers (IEEE)
Title: Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
Description:
The placement of a microphone at its correct position is crucial for
automatic speech recognition (ASR) in a single-microphone distant speech
recognition setup.
Understandably, it is a practical problem that the
performance degrades due to accidental displacement of the microphone
from the correct position through routine use and maintenance.
In this
paper, an approach is formulated which makes it possible to determine
the optimal position of the microphone, given the geometry of the room,
the location of the speaker, and the inexact location of the microphone.
The metric used to measure the correctness of the microphone position is
the average character error rate (cer) of the ASR system at that
position.
The proposed approach uses limited available data which is
usually the case when dealing with microphone position error analysis
while considering room acoustics.
A neural network classifier is trained
using limited data to associate the location of the microphone with the
corresponding class of cer values.
The obtained prediction is used to
trace the microphone back to the optimal position.
Related Results
Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
<p>The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understand...
Under-Sampled Microphone Data as a Wind Measurement
Under-Sampled Microphone Data as a Wind Measurement
It is common human experience that wind is associated with noise. Acoustic noise is generated by the turbulent pressure fluctuations associated with the shearing flow near the gro...
Revise and remember : managing hip resurfacing implant recall
Revise and remember : managing hip resurfacing implant recall
<h4>Background</h4><p dir="ltr">Total hip arthroplasty (THA) with metal-on-polyethylene articulation provides excellent clinical outcomes but remains associated w...
Revise and remember : managing hip resurfacing implant recall
Revise and remember : managing hip resurfacing implant recall
<h4>Background</h4><p dir="ltr">Total hip arthroplasty (THA) with metal-on-polyethylene articulation provides excellent clinical outcomes but remains associated w...
Artifact Subspace Reconstruction (ASR) for electroencephalography artifact removal must be optimized for each unique dataset
Artifact Subspace Reconstruction (ASR) for electroencephalography artifact removal must be optimized for each unique dataset
Artifact subspace reconstruction (ASR) is an automatic artifact reject method that can effectively remove transient or large-amplitude artifacts found in electroencephalographic (E...
Characterization of Micromachined Seesaw Type Microphone
Characterization of Micromachined Seesaw Type Microphone
A seesaw type microphone is characterized to study its feasibility for sound source localization. The microphone is composed of a flexible rectangular diaphragm sustained by two to...
Global burden of mental disorders in 204 countries and territories, 1990 - 2021: results from the global burden of disease study 2021
Global burden of mental disorders in 204 countries and territories, 1990 - 2021: results from the global burden of disease study 2021
Abstract
Background Mental disorders, one of the leading causes of the global health-related burden, which has been exacerbated by the emergence of the COVID-19 pandemic. I...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

