Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data

View through CrossRef
The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understandably, it is a practical problem that the performance degrades due to accidental displacement of the microphone from the correct position through routine use and maintenance. In this paper, an approach is formulated which makes it possible to determine the optimal position of the microphone, given the geometry of the room, the location of the speaker, and the inexact location of the microphone. The metric used to measure the correctness of the microphone position is the average character error rate (cer) of the ASR system at that position. The proposed approach uses limited available data which is usually the case when dealing with microphone position error analysis while considering room acoustics. A neural network classifier is trained using limited data to associate the location of the microphone with the corresponding class of cer values. The obtained prediction is used to trace the microphone back to the optimal position.
Institute of Electrical and Electronics Engineers (IEEE)
Title: Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
Description:
The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup.
Understandably, it is a practical problem that the performance degrades due to accidental displacement of the microphone from the correct position through routine use and maintenance.
In this paper, an approach is formulated which makes it possible to determine the optimal position of the microphone, given the geometry of the room, the location of the speaker, and the inexact location of the microphone.
The metric used to measure the correctness of the microphone position is the average character error rate (cer) of the ASR system at that position.
The proposed approach uses limited available data which is usually the case when dealing with microphone position error analysis while considering room acoustics.
A neural network classifier is trained using limited data to associate the location of the microphone with the corresponding class of cer values.
The obtained prediction is used to trace the microphone back to the optimal position.

Related Results

Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
<p>The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understand...
Under-Sampled Microphone Data as a Wind Measurement
Under-Sampled Microphone Data as a Wind Measurement
It is common human experience that wind is associated with noise. Acoustic noise is generated by the turbulent pressure fluctuations  associated with the shearing flow near the gro...
Revise and remember : managing hip resurfacing implant recall
Revise and remember : managing hip resurfacing implant recall
<h4>Background</h4><p dir="ltr">Total hip arthroplasty (THA) with metal-on-polyethylene articulation provides excellent clinical outcomes but remains associated w...
Revise and remember : managing hip resurfacing implant recall
Revise and remember : managing hip resurfacing implant recall
<h4>Background</h4><p dir="ltr">Total hip arthroplasty (THA) with metal-on-polyethylene articulation provides excellent clinical outcomes but remains associated w...
Artifact Subspace Reconstruction (ASR) for electroencephalography artifact removal must be optimized for each unique dataset
Artifact Subspace Reconstruction (ASR) for electroencephalography artifact removal must be optimized for each unique dataset
Artifact subspace reconstruction (ASR) is an automatic artifact reject method that can effectively remove transient or large-amplitude artifacts found in electroencephalographic (E...
Characterization of Micromachined Seesaw Type Microphone
Characterization of Micromachined Seesaw Type Microphone
A seesaw type microphone is characterized to study its feasibility for sound source localization. The microphone is composed of a flexible rectangular diaphragm sustained by two to...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Back to Top