Javascript must be enabled to continue!
Why do human languages have homophones?
View through CrossRef
Human languages are replete with ambiguity. This is most evident in homophony––where two or more words sound the same, but carry distinct meanings. For example, the wordform “bark” can denote either the sound produced by a dog or the protective outer sheath of a tree trunk. Why would a system evolved for efficient, effective communication display rampant ambiguity? Some accounts argue that ambiguity is actually a design feature of human communication systems, allowing languages to recycle their most optimal wordforms (those which are short, frequent, and phonotactically well-formed) for multiple meanings. We test this claim by constructing five series of artificial lexica matched for the phonotactics and distribution of word lengths found in five real languages (English, German, Dutch, French, and Japanese), and comparing both the quantity and concentration of homophony across the real and artificial lexica. Surprisingly, we find that the artificial lexica exhibit higher upper-bounds on homophony than their real counterparts, and that homophony is even more likely to be found among short, phonotactically plausible wordforms in the artificial than in the real lexica. These results suggest that homophony in real languages is not directly selected for, but rather, that it emerges as a natural consequence of other features of a language. In fact, homophony may even be selected against in real languages, producing lexica that better conform to other requirements of humans who need to use them. Finally, we explore the hypothesis that this is achieved by “smoothing” out dense concentrations of homophones across lexical neighborhoods, resulting in comparatively more minimal pairs in real lexica.
Title: Why do human languages have homophones?
Description:
Human languages are replete with ambiguity.
This is most evident in homophony––where two or more words sound the same, but carry distinct meanings.
For example, the wordform “bark” can denote either the sound produced by a dog or the protective outer sheath of a tree trunk.
Why would a system evolved for efficient, effective communication display rampant ambiguity? Some accounts argue that ambiguity is actually a design feature of human communication systems, allowing languages to recycle their most optimal wordforms (those which are short, frequent, and phonotactically well-formed) for multiple meanings.
We test this claim by constructing five series of artificial lexica matched for the phonotactics and distribution of word lengths found in five real languages (English, German, Dutch, French, and Japanese), and comparing both the quantity and concentration of homophony across the real and artificial lexica.
Surprisingly, we find that the artificial lexica exhibit higher upper-bounds on homophony than their real counterparts, and that homophony is even more likely to be found among short, phonotactically plausible wordforms in the artificial than in the real lexica.
These results suggest that homophony in real languages is not directly selected for, but rather, that it emerges as a natural consequence of other features of a language.
In fact, homophony may even be selected against in real languages, producing lexica that better conform to other requirements of humans who need to use them.
Finally, we explore the hypothesis that this is achieved by “smoothing” out dense concentrations of homophones across lexical neighborhoods, resulting in comparatively more minimal pairs in real lexica.
Related Results
Kra-Dai Languages
Kra-Dai Languages
Kra-Dai (also called Tai-Kadai and Kam-Tai) is a family of approximately 100 languages spoken in Southeast Asia, extending from the island of Hainan, China, in the east to the Indi...
Mande Languages
Mande Languages
Mande is a mid-range language family in Western Sub-Saharan Africa that includes 60 to 75 languages spoken by 30 to 40 million people. According to the glottochronological data, it...
Khoisan Languages
Khoisan Languages
The languages traditionally referred to as “Khoisan” languages are spoken in southern and eastern Africa, specifically in the Republic of South Africa, Namibia, Botswana, Angola, a...
Perbandingan Kosa Kata Antara Bahasa Dentong dan Bahasa Duri (Sebuah Tinjauan Linguistik)
Perbandingan Kosa Kata Antara Bahasa Dentong dan Bahasa Duri (Sebuah Tinjauan Linguistik)
The problems of this research are (1) the relationship of similarities and similarities in the vocabulary of Dentong and Duri languages (2) the relationship between sound and mea...
Sound and meaning: On the duration of Japanese homophones
Sound and meaning: On the duration of Japanese homophones
It is by now well known that the relation between sound and meaning is much less arbitrary than in the original conception of de Saussure. Yet, whether meaning can directly influen...
Mayan Languages
Mayan Languages
Mayan languages are spoken by over 5 million people in Guatemala, Mexico, Belize, and Honduras. There are around 30 different languages today, ranging in size from fairly large (ab...
Jewish Languages
Jewish Languages
Wherever Jews have lived, they have tended to speak and write somewhat differently from their non-Jewish neighbors. In some cases these differences have been limited to the additio...
Language Mapping of the Cordillera Administrative Region Using Relational Model
Language Mapping of the Cordillera Administrative Region Using Relational Model
Purpose–Various studies have already done the language mapping of the different languages of the Philippines, though it only consists of the most pop...

