Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Reducing Hallucination in Multilingual Voice Agents Using Instruction-Tuned Models

View through CrossRef
In highly applied multilingual voice agents of customer service and interactive AI systems in the world, one persistent problem constantly haunts the industry/field: hallucinations--syntactically adequate responses, which are logically wrong or simply inapplicable. These are multiplied in the multilingual setting as they have disparate linguistic peculiarities, professional vocabularies, and underrepresented languages. The current paper investigates how far it is forthcoming to trim hallucinations in multilingual voice agents via instruction-tuned models finetuned to pay heed to clear task guidelines. We provide finetuning of the big-resource multilingual transformer-based models (that is, mT5, XGLM, BLOOMZ), and propose the measure across ten languages as the measure for understanding, factual accuracy, and language consistency. To approximate the uniqueness of our methodology, we propose a hybrid evaluation apparatus of automated rating (BLEU, COMET, and factual consistency scores) and human evaluation, which is modified according to the cultural and linguistic peculiarities of the language, which is the subject of our research. Experiments conducted by us indicate that the tuning of instruction by a large margin decreases hallucinations, particularly when referring to retrieval-augmented generation (RAG) and task-completion instructions. We also discuss how the phenomenon of instruction tuning used in high-resource and low-resource languages is different and may lead to hallucination in the context of various families of languages. It may also be performed to determine the trends on both the structural and syntactic levels. Last, we suggest the most effective way to calibrate the pipes with the instructions in multilingual voice systems on an operational stage. With our findings, the future success of multilingual voice assistance is adapting instruction-tuned models to generate more factual reliability, compatibility in the languages, and confidence being guaranteed through mediation of safer and more responsible conversational AI agents across the language divide.
Auricle Global Society of Education and Research
Title: Reducing Hallucination in Multilingual Voice Agents Using Instruction-Tuned Models
Description:
In highly applied multilingual voice agents of customer service and interactive AI systems in the world, one persistent problem constantly haunts the industry/field: hallucinations--syntactically adequate responses, which are logically wrong or simply inapplicable.
These are multiplied in the multilingual setting as they have disparate linguistic peculiarities, professional vocabularies, and underrepresented languages.
The current paper investigates how far it is forthcoming to trim hallucinations in multilingual voice agents via instruction-tuned models finetuned to pay heed to clear task guidelines.
We provide finetuning of the big-resource multilingual transformer-based models (that is, mT5, XGLM, BLOOMZ), and propose the measure across ten languages as the measure for understanding, factual accuracy, and language consistency.
To approximate the uniqueness of our methodology, we propose a hybrid evaluation apparatus of automated rating (BLEU, COMET, and factual consistency scores) and human evaluation, which is modified according to the cultural and linguistic peculiarities of the language, which is the subject of our research.
Experiments conducted by us indicate that the tuning of instruction by a large margin decreases hallucinations, particularly when referring to retrieval-augmented generation (RAG) and task-completion instructions.
We also discuss how the phenomenon of instruction tuning used in high-resource and low-resource languages is different and may lead to hallucination in the context of various families of languages.
It may also be performed to determine the trends on both the structural and syntactic levels.
Last, we suggest the most effective way to calibrate the pipes with the instructions in multilingual voice systems on an operational stage.
With our findings, the future success of multilingual voice assistance is adapting instruction-tuned models to generate more factual reliability, compatibility in the languages, and confidence being guaranteed through mediation of safer and more responsible conversational AI agents across the language divide.

Related Results

Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
How to speak and vocal hygiene
How to speak and vocal hygiene
An abnormal tongue shape, pitch difference or voice quality can lead to difficulty communicating effectively. Common among teachers are voice issues, which can be uncomfortable and...
EFFECT OF BILINGUAL INSTRUCTIONAL METHOD IN THE ACADEMIC ACHIEVEMENT OF JUNIOR SECONDARY SCHOOL STUDENTS IN MATHEMATICS
EFFECT OF BILINGUAL INSTRUCTIONAL METHOD IN THE ACADEMIC ACHIEVEMENT OF JUNIOR SECONDARY SCHOOL STUDENTS IN MATHEMATICS
The importance of mathematics in the modern society is overwhelming. The importance of mathematics has long been recognized all over the world, and that is why all students are req...
GERIATRIC EVALUATION IN 27 CASES OF MUSICAL HALLUCINATION
GERIATRIC EVALUATION IN 27 CASES OF MUSICAL HALLUCINATION
Background: Musical hallucination (AM) is a type of complex auditory hallucination described as hearing musical tones, rhythms, harmonies, and melodies without the corresponding ex...
Moving towards (new) multilingual paradigms
Moving towards (new) multilingual paradigms
Abstract Multilingual education is increasingly perceived as a desirable goal in a world where global networks play a significant role. Crucially, educating multilin...
Future automobile driving space voice interaction: adapt to the driving scenarios and user personalities
Future automobile driving space voice interaction: adapt to the driving scenarios and user personalities
This paper investigates in-car voice interaction, where in-car voice assistants are becoming a common form of interaction in the car. However, voice assistants are unable to natura...

Back to Top