Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Mediating effects of NLP-based parameters on the readability of crowdsourced wikipedia articles

View through CrossRef
AbstractIn this era of information and communication technology, a large population relies on the Internet to gather information. One of the most popular information sources on the Internet is Wikipedia. Wikipedia is a free encyclopedia that provides a wide range of information to its users. However, there have been concerns about the readability of information on Wikipedia time and again. The readability of the text is defined as the ease of understanding the underlying text. Past studies have analyzed the readability of Wikipedia articles with the help of conventional readability metrics, such as the Flesch-Kincaid readability score and the Automatic Readability Index (ARI). Such metrics only consider the surface-level parameters, such as the number of words, sentences, and paragraphs in the text, to quantify the readability. However, the readability of the text must also take into account the quality of the text. In this study, we consider many new NLP-based parameters capturing the quality of the text, such as lexical diversity, semantic diversity, lexical complexity, and semantic complexity and analyze their impact on the readability of Wikipedia articles using artificial neural networks. Besides NLP parameters, the crowdsourced parameters also affect the readability, and therefore, we also analyze the impact of crowdsourced parameters and observe that the crowdsourced parameters not only influence the readability scores but also affect the NLP parameters of the text. Additionally, we investigate the mediating effect of NLP parameters that connect the crowdsourced parameters to the readability of the text. The results show that the impact of crowdsourced parameters on readability is partially due to the profound effect of NLP-based parameters.
Title: Mediating effects of NLP-based parameters on the readability of crowdsourced wikipedia articles
Description:
AbstractIn this era of information and communication technology, a large population relies on the Internet to gather information.
One of the most popular information sources on the Internet is Wikipedia.
Wikipedia is a free encyclopedia that provides a wide range of information to its users.
However, there have been concerns about the readability of information on Wikipedia time and again.
The readability of the text is defined as the ease of understanding the underlying text.
Past studies have analyzed the readability of Wikipedia articles with the help of conventional readability metrics, such as the Flesch-Kincaid readability score and the Automatic Readability Index (ARI).
Such metrics only consider the surface-level parameters, such as the number of words, sentences, and paragraphs in the text, to quantify the readability.
However, the readability of the text must also take into account the quality of the text.
In this study, we consider many new NLP-based parameters capturing the quality of the text, such as lexical diversity, semantic diversity, lexical complexity, and semantic complexity and analyze their impact on the readability of Wikipedia articles using artificial neural networks.
Besides NLP parameters, the crowdsourced parameters also affect the readability, and therefore, we also analyze the impact of crowdsourced parameters and observe that the crowdsourced parameters not only influence the readability scores but also affect the NLP parameters of the text.
Additionally, we investigate the mediating effect of NLP parameters that connect the crowdsourced parameters to the readability of the text.
The results show that the impact of crowdsourced parameters on readability is partially due to the profound effect of NLP-based parameters.

Related Results

Wikipedia in Vascular Surgery Medical Education: Comparative Study (Preprint)
Wikipedia in Vascular Surgery Medical Education: Comparative Study (Preprint)
BACKGROUND Medical students commonly refer to Wikipedia as their preferred online resource for medical information. The quality and readability of articles ...
Accuracy and readability of cardiovascular entries on Wikipedia: are they reliable learning resources for medical students?
Accuracy and readability of cardiovascular entries on Wikipedia: are they reliable learning resources for medical students?
Objective To evaluate accuracy of content and readability level of English Wikipedia articles on cardiovascular diseases, using quality and readability tools. ...
AI and Incidental Findings
AI and Incidental Findings
Photo by Accuray on Unsplash INTRODUCTION Delayed and missed follow-up on incidental findings threatens patient health and is a major financial risk for healthcare systems. The hea...
Exploiting Wikipedia Semantics for Computing Word Associations
Exploiting Wikipedia Semantics for Computing Word Associations
<p><b>Semantic association computation is the process of automatically quantifying the strength of a semantic connection between two textual units based on various lexi...
Wikipedia: a tool to monitor seasonal diseases trends?
Wikipedia: a tool to monitor seasonal diseases trends?
ObjectiveTo explore the interest of Wikipedia as a data source to monitorseasonal diseases trends in metropolitan France.IntroductionToday, Internet, especially Wikipedia, is an im...
TINGKAT KETERBACAAN BUKU TEKS BAHASA INDONESIA KURIKULUM MERDEKA UNTUK KELAS X SMA/SMK
TINGKAT KETERBACAAN BUKU TEKS BAHASA INDONESIA KURIKULUM MERDEKA UNTUK KELAS X SMA/SMK
The ever-changing curriculum means that teaching materials, mainly textbooks, also continue to change. This change is not offset by the quality of textbooks discourse, which still ...
A Critical Review of Research on Ethics of Crowdsourced Translation at Home and Abroad (2009–2024)
A Critical Review of Research on Ethics of Crowdsourced Translation at Home and Abroad (2009–2024)
With the rapid development of global crowdsourced translation practices, research on the phenomenon’s multifaceted ethical dimensions has steadily progressed. This paper presents t...
COVID-19 research in Wikipedia
COVID-19 research in Wikipedia
Wikipedia is one of the main sources of free knowledge on the Web. During the first few months of the pandemic, over 5,200 new Wikipedia pages on COVID-19 were created, accumulatin...

Back to Top