Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Automatic quantification of lexical ambiguity using large-scale word association data

View through CrossRef
Most words in a language are lexically ambiguous and are associated with multiple meanings that vary in their frequency and relatedness. Although ambiguity is a fundamental property of language, there are extensive issues with existing measures of this construct. For instance, dictionary-based classifications and subjective ratings of meaning number and frequency struggle to capture the graded nature of ambiguity and, by proxy, its impact on cognition and performance in experimental tasks. It is also difficult to scale subjective measures to the full lexicon. We introduce a novel, automated framework to measure lexical ambiguity based on word association data from the Small World of Words (SWOW) project. We apply community detection algorithms to association graphs to quantify both the number and distribution of semantic communities for each word. This in turn allows us to derive graded representations of meaning frequency and relatedness. To better understand our new metrics, we compare them to previously published subjective norms, and establish their validity by showing that they predict lexical decision performance in English and Rioplatense Spanish. Furthermore, our results reveal cross-linguistic differences in lexical ambiguity—Spanish is less ambiguous than English overall—which we hypothesize is due to typological differences between the languages. Our validated framework contributes novel insights for computational and psycholinguistic models of semantic processing, and offers a scalable, automated, and language-independent framework for quantifying different facets of lexical ambiguity. We provide all of our code and ambiguity measures for approximately 7000 words in both languages to facilitate their use by other researchers.
Title: Automatic quantification of lexical ambiguity using large-scale word association data
Description:
Most words in a language are lexically ambiguous and are associated with multiple meanings that vary in their frequency and relatedness.
Although ambiguity is a fundamental property of language, there are extensive issues with existing measures of this construct.
For instance, dictionary-based classifications and subjective ratings of meaning number and frequency struggle to capture the graded nature of ambiguity and, by proxy, its impact on cognition and performance in experimental tasks.
It is also difficult to scale subjective measures to the full lexicon.
We introduce a novel, automated framework to measure lexical ambiguity based on word association data from the Small World of Words (SWOW) project.
We apply community detection algorithms to association graphs to quantify both the number and distribution of semantic communities for each word.
This in turn allows us to derive graded representations of meaning frequency and relatedness.
To better understand our new metrics, we compare them to previously published subjective norms, and establish their validity by showing that they predict lexical decision performance in English and Rioplatense Spanish.
Furthermore, our results reveal cross-linguistic differences in lexical ambiguity—Spanish is less ambiguous than English overall—which we hypothesize is due to typological differences between the languages.
Our validated framework contributes novel insights for computational and psycholinguistic models of semantic processing, and offers a scalable, automated, and language-independent framework for quantifying different facets of lexical ambiguity.
We provide all of our code and ambiguity measures for approximately 7000 words in both languages to facilitate their use by other researchers.

Related Results

Spoken Word Recognition
Spoken Word Recognition
The core question that spoken word recognition research attempts to address is: How does a phonological word-form activate the corresponding lexical representation that is stored i...
Classification of Bisyllabic Lexical Stress Patterns Using Deep Neural Networks
Classification of Bisyllabic Lexical Stress Patterns Using Deep Neural Networks
Background and Objectives: As English is a stress-timed language, lexical stress plays an important role in the perception and processing of speech by native speakers. Incorrect st...
LEXICAL AND SYNTACTIC AMBIGUITY IN HUMOR
LEXICAL AND SYNTACTIC AMBIGUITY IN HUMOR
Ambiguity occurs when a sentence has more than one meaning. Ambiguity can be caused by the ambiguous lexicon in which one word has more than one meaning and it can also be caused b...
ARABIC POLYSEMY
ARABIC POLYSEMY
The word as a spiritual unit has lexical and grammatical semantics. Lexicology considers a word as a unit of the lexical content of a language, making lexical semantics, the basic ...
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Abstract A cervical rib (CR), also known as a supernumerary or extra rib, is an additional rib that forms above the first rib, resulting from the overgrowth of the transverse proce...
Lexical Richness of Chinese College Students’ Spoken English
Lexical Richness of Chinese College Students’ Spoken English
Lexical richness has been considered one of the most effective methods of assessing writing proficiency. However, the studies on spoken English lexical richness for EFL Chinese stu...
Overcoming lexical interference in Chinese students learning Russian
Overcoming lexical interference in Chinese students learning Russian
Background. The article addresses the issue of lexical interference among Chinese students learning Russian as a foreign language. This phenomenon is due to significant differences...
Lexical Complexity of IELTS Academic Writing Task 2 Model Answers at Band Score 6
Lexical Complexity of IELTS Academic Writing Task 2 Model Answers at Band Score 6
This study examined the lexical complexity of IELTS Academic Writing Task 2 model answers rated at Band 6 from IELTS preparation books, aiming to provide practical insights for edu...

Back to Top