Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Fine-Tuning Large Language Models for Saudi Arabic Voice Agents

View through CrossRef
To support the growing voice-focused technologies in Saudi Arabia, such as innovative city solutions, government services, healthcare, and finance that require voice-assisted search and navigation, there is a need to create voice agents that will provide linguistic and cultural accuracy with high linguistic clarity and understanding of the Saudi Arabian culture. Historical systems of natural language processing (NLP) that are usually trained over Modern Standard Arabic (MSA) or generalized (dialect) corpus in general were not necessarily capable of representing the regional, phonetic, and pragmatic peculiarities of Saudi Arabic across quite different Najdi, Hijazi, Gulf, and Southern dialects. This paper discusses fine-tuning the large language models (LLMs) to enable productive spoken-dialog systems tailored to the Saudi users. It incorporates intensive data collections with the help of multiple Saudi Arabic sources, labeling data by dialect, preprocessing the acoustic features, and fine-tuning (several stages) of transformer-based systems. The method entails the hybrid training with textual and audio data, and the performance assessment is carried out using both automatic measures (e.g., WER, BLEU) and human expertise of the trustworthiness, fluency, and sociocultural compatibility. The practical result shows that fine-tuned models can bring a far greater accuracy than baseline MSA or generic Arabic models in particular domains of use like e-government services, travel agencies specializing in religion, and triaging healthcare systems. Issues such as ethics and practicality of fairness of dialect representation, privacy of voice data, and sociolinguistic bias are crucial ethical and practical issues that the author discusses in the paper. In addition to being usable, voice agents would require cultural competence to make them inclusive and digitally equitable. This work presents a language-aware framework to support regional language learning options in Saudi Arabian, which further provides a blueprint that can be scaled to localize the use of LLMs in less-representative linguistic contexts, and a portion of the Saudi Arabian government is likely to meet its overall AI and digital transformation vision as outlined in Vision 2030.
Auricle Global Society of Education and Research
Title: Fine-Tuning Large Language Models for Saudi Arabic Voice Agents
Description:
To support the growing voice-focused technologies in Saudi Arabia, such as innovative city solutions, government services, healthcare, and finance that require voice-assisted search and navigation, there is a need to create voice agents that will provide linguistic and cultural accuracy with high linguistic clarity and understanding of the Saudi Arabian culture.
Historical systems of natural language processing (NLP) that are usually trained over Modern Standard Arabic (MSA) or generalized (dialect) corpus in general were not necessarily capable of representing the regional, phonetic, and pragmatic peculiarities of Saudi Arabic across quite different Najdi, Hijazi, Gulf, and Southern dialects.
This paper discusses fine-tuning the large language models (LLMs) to enable productive spoken-dialog systems tailored to the Saudi users.
It incorporates intensive data collections with the help of multiple Saudi Arabic sources, labeling data by dialect, preprocessing the acoustic features, and fine-tuning (several stages) of transformer-based systems.
The method entails the hybrid training with textual and audio data, and the performance assessment is carried out using both automatic measures (e.
g.
, WER, BLEU) and human expertise of the trustworthiness, fluency, and sociocultural compatibility.
The practical result shows that fine-tuned models can bring a far greater accuracy than baseline MSA or generic Arabic models in particular domains of use like e-government services, travel agencies specializing in religion, and triaging healthcare systems.
Issues such as ethics and practicality of fairness of dialect representation, privacy of voice data, and sociolinguistic bias are crucial ethical and practical issues that the author discusses in the paper.
In addition to being usable, voice agents would require cultural competence to make them inclusive and digitally equitable.
This work presents a language-aware framework to support regional language learning options in Saudi Arabian, which further provides a blueprint that can be scaled to localize the use of LLMs in less-representative linguistic contexts, and a portion of the Saudi Arabian government is likely to meet its overall AI and digital transformation vision as outlined in Vision 2030.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Language is an important tool for the life of civilized man. Through language, people can communicate with each other, and convey their intentions and feelings to others. The moder...
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
قصيد”اللغة العربية تنعى حظها بين أهلها“ لحافظ ابراهيم: دراسة تحليلية
Many Languages are spoken in the world. The diversity of human languages and colors are sign of Allah, for those of knowledge (Al-Quran, 30:22). Although the Arabic language origin...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Arabic Natural Language Processing
Arabic Natural Language Processing
The Arabic language presents researchers and developers of natural language processing (NLP) applications for Arabic text and speech with serious challenges. The purpose of this ar...
Arabic Learning for Academic Purposes
Arabic Learning for Academic Purposes
This study aimed to determine the goal of teaching Arabic for Academic purposes. Teaching Arabic for non-Arabic speakers is generally divided into two types: Arabic language for li...

Back to Top