Javascript must be enabled to continue!
Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
View through CrossRef
Recent research indicates that pretraining cross-lingual language models on large-scale unlabeled texts yields significant performance improvements over various cross-lingual and low-resource tasks. Through training on one hundred languages and terabytes of texts, cross-lingual language models have proven to be effective in leveraging high-resource languages to enhance low-resource language processing and outperform monolingual models. In this paper, we further investigate the cross-lingual and cross-domain (CLCD) setting when a pretrained cross-lingual language model needs to adapt to new domains. Specifically, we propose a novel unsupervised feature decomposition method that can automatically extract domain-specific features and domain-invariant features from the entangled pretrained cross-lingual representations, given unlabeled raw texts in the source language. Our proposed model leverages mutual information estimation to decompose the representations computed by a cross-lingual model into domain-invariant and domain-specific parts. Experimental results show that our proposed method achieves significant performance improvements over the state-of-the-art pretrained cross-lingual language model in the CLCD setting.
International Joint Conferences on Artificial Intelligence Organization
Title: Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
Description:
Recent research indicates that pretraining cross-lingual language models on large-scale unlabeled texts yields significant performance improvements over various cross-lingual and low-resource tasks.
Through training on one hundred languages and terabytes of texts, cross-lingual language models have proven to be effective in leveraging high-resource languages to enhance low-resource language processing and outperform monolingual models.
In this paper, we further investigate the cross-lingual and cross-domain (CLCD) setting when a pretrained cross-lingual language model needs to adapt to new domains.
Specifically, we propose a novel unsupervised feature decomposition method that can automatically extract domain-specific features and domain-invariant features from the entangled pretrained cross-lingual representations, given unlabeled raw texts in the source language.
Our proposed model leverages mutual information estimation to decompose the representations computed by a cross-lingual model into domain-invariant and domain-specific parts.
Experimental results show that our proposed method achieves significant performance improvements over the state-of-the-art pretrained cross-lingual language model in the CLCD setting.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Effects of Age and Gender during Three Lingual Tasks on Peak Lingual Pressures in Healthy Adults
Effects of Age and Gender during Three Lingual Tasks on Peak Lingual Pressures in Healthy Adults
Purpose: This study examined the effects of age and gender during three intra-oral lingual tasks (elevation, protrusion, and depression) on peak lingual pressure in healthy adults....
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Adaptive Planning for Resilient Coastal Waterfronts
Adaptive Planning for Resilient Coastal Waterfronts
Many delta and coastal cities worldwide face increasing flood risk due to changing climate conditions and sea level rise. The question is how to develop measures and strategies for...
PERILAKU SATUAN LINGUAL -(N)ING DALAM BAHASA JAWA (LINGUAL UNIT BEHAVIOR -(N)ING IN JAVANESE LANGUANGE)
PERILAKU SATUAN LINGUAL -(N)ING DALAM BAHASA JAWA (LINGUAL UNIT BEHAVIOR -(N)ING IN JAVANESE LANGUANGE)
Penelitian ini berjudul Perilaku Satuan Lingual (n)ing dalam Bahasa Jawa. Teori yang digunakan dalam kajian ini ialah kategori kata dan analisis konstituen. Pengumpulan data menggu...
RESEARCHING WRITTEN MONUMENTS IN THE CONTEXT OF CHANGING SCIENTIFIC PARADIGMS
RESEARCHING WRITTEN MONUMENTS IN THE CONTEXT OF CHANGING SCIENTIFIC PARADIGMS
The scientific paradigm of the 21st century has acquired anthropocentric drift. In modern linguistic studies, the anthropocentric approach also occupies a dominant position: the re...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...
Mental practice of lingual resistance and cortical plasticity in older adults: An
exploratory fNIRS study
Mental practice of lingual resistance and cortical plasticity in older adults: An
exploratory fNIRS study
Purpose: Mental practice using motor imagery (MP) improves motor strength and
coordination in the upper and lower extremities in clinical patient populations. Its
...

