Javascript must be enabled to continue!
The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
View through CrossRef
There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. In this paper, we study whether linguistic knowledge is a necessary condition for the good performance of modern language models, which we call the rediscovery hypothesis.
In the first place, we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures. This result supports the rediscovery hypothesis and leads to the second contribution of our paper: an information-theoretic framework that relates language modeling objectives with linguistic information. This framework also provides a metric to measure the impact of linguistic information on the word prediction task. We reinforce our analytical results with various experiments, both on synthetic and on real NLP tasks in English.
Title: The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
Description:
There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes.
In this paper, we study whether linguistic knowledge is a necessary condition for the good performance of modern language models, which we call the rediscovery hypothesis.
In the first place, we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures.
This result supports the rediscovery hypothesis and leads to the second contribution of our paper: an information-theoretic framework that relates language modeling objectives with linguistic information.
This framework also provides a metric to measure the impact of linguistic information on the word prediction task.
We reinforce our analytical results with various experiments, both on synthetic and on real NLP tasks in English.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
A Wideband mm-Wave Printed Dipole Antenna for 5G Applications
A Wideband mm-Wave Printed Dipole Antenna for 5G Applications
<span lang="EN-MY">In this paper, a wideband millimeter-wave (mm-Wave) printed dipole antenna is proposed to be used for fifth generation (5G) communications. The single elem...
REFLECTING THE ATTITUDES ABOUT THE SCHOLARLY CONTRIBUTION OF ACADEMICIAN VOJISLAV P. NIKČEVIĆ
REFLECTING THE ATTITUDES ABOUT THE SCHOLARLY CONTRIBUTION OF ACADEMICIAN VOJISLAV P. NIKČEVIĆ
The modern meaning of linguistic and literal science in Montenegro comes from the pioneer’s works of academic Vojislav P. Nikcevic, who made in period from 1965. to 2007., not only...
Rodnoosjetljiv jezik na primjeru njemačkih časopisa Brigitte i Der Spiegel
Rodnoosjetljiv jezik na primjeru njemačkih časopisa Brigitte i Der Spiegel
On the basis of the comparative analysis of texts of the German biweekly magazine Brigitte and the weekly magazine Der Spiegel and under the presumption that gender-sensitive langu...
Aviation English - A global perspective: analysis, teaching, assessment
Aviation English - A global perspective: analysis, teaching, assessment
This e-book brings together 13 chapters written by aviation English researchers and practitioners settled in six different countries, representing institutions and universities fro...
An Analysis of The Obstacles Found by Teachers in Using Google Meet Application in Online English Learning at SMP Dwijendra Denpasar
An Analysis of The Obstacles Found by Teachers in Using Google Meet Application in Online English Learning at SMP Dwijendra Denpasar
Abstrak- Penelitian ini bertujuan untuk menyelidiki dan menganalisis kendala yang ditemukan oleh guru dalam menggunakan aplikasi Google Meet pada pembelajaran Bahasa Inggris online...
Pedagogical Linguistics in Romance
Pedagogical Linguistics in Romance
Linguists tend to underestimate their educational and societal role, but linguistics matters. Therefore, this article is a plea for linguists to act accordingly and to fulfill thei...
Language Maintenance
Language Maintenance
Language maintenance involves efforts to maintain an existing language, as opposed to language planning (see the separate Oxford Bibliographies in Linguistics article Language Poli...

