Javascript must be enabled to continue!

Unsupervised text segmentation predicts eye ﬁxations during reading

Words typically form the basis of psycholinguistic and computational linguistic studies about sentence processing. However, recent evidence shows the basic units during reading, i.e., the items in the mental lexicon, are not always words, but could also be sub-word and supra-word units. To recognize these units, human readers require a cognitive mechanism to learn and detect them. In this paper, we assume eye fixations during reading reveal the locations of the cognitive units, and that the cognitive units are analogous with the text units discovered by unsupervised segmentation models. We predict eye fixations by model-segmented units on both English and Dutch text. The results show the model-segmented units predict eye fixations better than word units. This finding suggests that the predictive performance of model-segmented units indicates their plausibility as cognitive units. The Less-is-Better (LiB) model, which finds the units that minimize both long-term and working memory load, offers advantages both in terms of prediction score and efficiency among alternative models. Our results also suggest that modeling the least-effort principle on the management of long-term and working memory can lead to inferring cognitive units. Overall, the study supports the theory that the mental lexicon stores not only words but also smaller and larger units, suggests that fixation locations during reading depend on these units, and shows that unsupervised segmentation models can discover these units.

Center for Open Science

Jinbiao Yang Antal van den Bosch Stefan L. Frank

2021

Title: Unsupervised text segmentation predicts eye ﬁxations during reading

Description:

Words typically form the basis of psycholinguistic and computational linguistic studies about sentence processing.

However, recent evidence shows the basic units during reading, i.

, the items in the mental lexicon, are not always words, but could also be sub-word and supra-word units.

To recognize these units, human readers require a cognitive mechanism to learn and detect them.

In this paper, we assume eye fixations during reading reveal the locations of the cognitive units, and that the cognitive units are analogous with the text units discovered by unsupervised segmentation models.

We predict eye fixations by model-segmented units on both English and Dutch text.

The results show the model-segmented units predict eye fixations better than word units.

This finding suggests that the predictive performance of model-segmented units indicates their plausibility as cognitive units.

The Less-is-Better (LiB) model, which finds the units that minimize both long-term and working memory load, offers advantages both in terms of prediction score and efficiency among alternative models.

Our results also suggest that modeling the least-effort principle on the management of long-term and working memory can lead to inferring cognitive units.

Overall, the study supports the theory that the mental lexicon stores not only words but also smaller and larger units, suggests that fixation locations during reading depend on these units, and shows that unsupervised segmentation models can discover these units.

Back

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Sleep Habits and Occurrence of Lowback Pain among Craftsmen

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Bounds on the sum of broadcast domination number and strong metric dimension of graphs

Let [Formula: see text] be a connected graph of order at least two with vertex set [Formula: see text]. For [Formula: see text], let [Formula: see text] denote the length of an [Fo...

ANALYSIS OF READING MATERIALS IN TEXTBOOK FOR GRADE XI SENIOR HIGH SCHOOL

This study aims to find out the GI and LD level, the text which has the highest GI and LD and what make the text has the highest GI and LD of Advanced Learning English 2 textbook. ...

Incidental Collocation Learning from Different Modes of Input and Factors That Affect Learning

Collocations, i.e., words that habitually co-occur in texts (e.g., strong coffee, heavy smoker), are ubiquitous in language and thus crucial for second/foreign language (L2) learne...

Multiple surface segmentation using novel deep learning and graph based methods

<p>The task of automatically segmenting 3-D surfaces representing object boundaries is important in quantitative analysis of volumetric images, which plays a vital role in nu...

AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images

AbstractBackgroundMedical image segmentation is a fundamental task in medical image analysis and has been widely applied in multiple medical fields. The latest transformer‐based de...

Upaya Guru dalam Meningkatkan Minat Membaca Anak pada Masa Adaptasi Kebiasaan Baru di BMBA AIUEO Batujajar Bandung

Abstract. Based on the PISA report which was just released 2019, Indonesia's reading score is ranked 72 out of 77 countries (liputan6.com,2019). This condition shows the poor inter...

Email:
Password:

Email:

Unsupervised text segmentation predicts eye ﬁxations during reading

Related Results