Javascript must be enabled to continue!
Transliteration-Aided Transfer Learning for Low-Resource ASR: A Case Study on Khalkha Mongolian
View through CrossRef
Automatic Speech Recognition (ASR) systems have made consistent advancements, achieving notable improvements in state-of-the-art performance across various languages. However, their effectiveness often declines significantly in low-resource settings, where data and linguistic resources are limited. This paper addresses the challenges of ASR for a low-resource language, Khalkha Mongolian, by leveraging a transliteration-aided transfer learning approach. Specifically, it improves the ASR system for Khalkha Mongolian by transliterating text from a well-resourced Chakhar Mongolian (Uighur script) dataset to the Cyrillic script and then fine-tuning it with Khalkha Mongolian data. The method effectively enhances the ASR performance of Khalkha Mongolian. The effectiveness of the proposed method was validated on three popular ASR models, Wav2Vec2-BERT, Conformer-Large, and Whisper-large-v3. Among these models, the best relative improvement in word error rate (WER) reaches 32.50%, while the absolute improvement reaches 19.26%.
Title: Transliteration-Aided Transfer Learning for Low-Resource ASR: A Case Study on Khalkha Mongolian
Description:
Automatic Speech Recognition (ASR) systems have made consistent advancements, achieving notable improvements in state-of-the-art performance across various languages.
However, their effectiveness often declines significantly in low-resource settings, where data and linguistic resources are limited.
This paper addresses the challenges of ASR for a low-resource language, Khalkha Mongolian, by leveraging a transliteration-aided transfer learning approach.
Specifically, it improves the ASR system for Khalkha Mongolian by transliterating text from a well-resourced Chakhar Mongolian (Uighur script) dataset to the Cyrillic script and then fine-tuning it with Khalkha Mongolian data.
The method effectively enhances the ASR performance of Khalkha Mongolian.
The effectiveness of the proposed method was validated on three popular ASR models, Wav2Vec2-BERT, Conformer-Large, and Whisper-large-v3.
Among these models, the best relative improvement in word error rate (WER) reaches 32.
50%, while the absolute improvement reaches 19.
26%.
Related Results
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
On the Study of Mongolian Script Lexicography
On the Study of Mongolian Script Lexicography
In this article, the history of lexicography of Mongolian linguistics, including the lexicography of Mongolian writing, is discussed. Mongolian linguistics has a rich history of le...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Global burden of mental disorders in 204 countries and territories, 1990 - 2021: results from the global burden of disease study 2021
Global burden of mental disorders in 204 countries and territories, 1990 - 2021: results from the global burden of disease study 2021
Abstract
Background Mental disorders, one of the leading causes of the global health-related burden, which has been exacerbated by the emergence of the COVID-19 pandemic. I...
Results of Compliance Test for Determining the Mongolian Script Knowledge and Skills of Civil Servants
Results of Compliance Test for Determining the Mongolian Script Knowledge and Skills of Civil Servants
In the framework of conducting the preparatory work for the implementation of the medium and long-term planning of the official transition to the use of the Mongolian script, the r...
Bridging Language Gaps: A Dive Into Cross-Lingual Named Entity Transliteration in Chinese
Bridging Language Gaps: A Dive Into Cross-Lingual Named Entity Transliteration in Chinese
<p>Language is a fundamental component of culture and identity. The transliteration of language names into Chinese, a complex task requiring a deep understanding of both ling...
Meningkatkan Kemampuan Menulis Papan Nama Menggunakan Aplikasi Transliterasi Akara Bali Siswa SMA Negeri 1 Tegallalang
Meningkatkan Kemampuan Menulis Papan Nama Menggunakan Aplikasi Transliterasi Akara Bali Siswa SMA Negeri 1 Tegallalang
Penelitian ini bertujuan untuk membantu siswa dalam meningkatkan kemampuannya menulis papan namaa aksara Bali menggunakan media Aplikasi Transliterasi Aksara Bali siswa kelas XII I...
Mongolian Buddhist Scholars’ Works on Infectious Diseases (Late 17th Century to the Beginning of the 20th Century)
Mongolian Buddhist Scholars’ Works on Infectious Diseases (Late 17th Century to the Beginning of the 20th Century)
The Qing period saw both the flowering of Buddhism in Mongolia as well as the arrival of new infectious diseases such as smallpox and syphilis which had reached epidemic levels by ...

