Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Information-Theoretic Analysis of Targeted Regressions during Reading

View through CrossRef
Regressions, or backward saccades, are common during reading, accounting for between 5% and 20% of all saccades. And yet, relatively little is known about what causes them. We provide an information-theoretic operationalization for two previous qualitative hypotheses about regressions, which we dub reactivation and reanalysis. We argue that these hypotheses make different predictions about the pointwise mutual information or pmi between a regression’s source and target. Intuitively, the pmi between two words measures how much more (or less) likely one word is to be present given the other. On one hand, the reactivation hypothesis predicts that regressions occur between words that are associated, implying high positive values of pmi. On the other hand, the reanalysis hypothesis predicts that regressions should occur between words that are disassociated with each other, implying negative, low values of pmi. As a second theoretical contribution, we expand on previous theories by considering not only pmi but also expected values of pmi, E[pmi], where the expectation is taken over all possible realizations of the regression’s target. The rationale for this is that language processing involves making inferences under uncertainty, and readers may be uncertain about what they have read, especially if a previous word was skipped. To test both theories, we use contemporary language models to estimate pmi-based statistics over word pairs in three corpora of eye tracking data in English, as well as in six languages across three language families (Indo-European, Uralic, and Turkic). Our results are consistent across languages and models tested: Positive values of pmi and E[pmi] consistently help to predict the patterns of regressions during reading, whereas negative values of pmi and E[pmi] do not. Our information-theoretic interpretation increases the predictive scope of both theories and our studies present the first systematic crosslinguistic analysis of regressions in the literature. Our results support the reactivation hypothesis and, more broadly, they expand the number of language processing behaviors that can be linked to information-theoretic principles.
Title: An Information-Theoretic Analysis of Targeted Regressions during Reading
Description:
Regressions, or backward saccades, are common during reading, accounting for between 5% and 20% of all saccades.
And yet, relatively little is known about what causes them.
We provide an information-theoretic operationalization for two previous qualitative hypotheses about regressions, which we dub reactivation and reanalysis.
We argue that these hypotheses make different predictions about the pointwise mutual information or pmi between a regression’s source and target.
Intuitively, the pmi between two words measures how much more (or less) likely one word is to be present given the other.
On one hand, the reactivation hypothesis predicts that regressions occur between words that are associated, implying high positive values of pmi.
On the other hand, the reanalysis hypothesis predicts that regressions should occur between words that are disassociated with each other, implying negative, low values of pmi.
As a second theoretical contribution, we expand on previous theories by considering not only pmi but also expected values of pmi, E[pmi], where the expectation is taken over all possible realizations of the regression’s target.
The rationale for this is that language processing involves making inferences under uncertainty, and readers may be uncertain about what they have read, especially if a previous word was skipped.
To test both theories, we use contemporary language models to estimate pmi-based statistics over word pairs in three corpora of eye tracking data in English, as well as in six languages across three language families (Indo-European, Uralic, and Turkic).
Our results are consistent across languages and models tested: Positive values of pmi and E[pmi] consistently help to predict the patterns of regressions during reading, whereas negative values of pmi and E[pmi] do not.
Our information-theoretic interpretation increases the predictive scope of both theories and our studies present the first systematic crosslinguistic analysis of regressions in the literature.
Our results support the reactivation hypothesis and, more broadly, they expand the number of language processing behaviors that can be linked to information-theoretic principles.

Related Results

Incidental Collocation Learning from Different Modes of Input and Factors That Affect Learning
Incidental Collocation Learning from Different Modes of Input and Factors That Affect Learning
Collocations, i.e., words that habitually co-occur in texts (e.g., strong coffee, heavy smoker), are ubiquitous in language and thus crucial for second/foreign language (L2) learne...
Upaya Guru dalam Meningkatkan Minat Membaca Anak pada Masa Adaptasi Kebiasaan Baru di BMBA AIUEO Batujajar Bandung
Upaya Guru dalam Meningkatkan Minat Membaca Anak pada Masa Adaptasi Kebiasaan Baru di BMBA AIUEO Batujajar Bandung
Abstract. Based on the PISA report which was just released 2019, Indonesia's reading score is ranked 72 out of 77 countries (liputan6.com,2019). This condition shows the poor inter...
Understanding Reading Development: The Interplay of Fluency, Engagement, and Reading Anxiety in Early Grades
Understanding Reading Development: The Interplay of Fluency, Engagement, and Reading Anxiety in Early Grades
BackgroundReading achievement is positively associated with reading engagement; however, reading anxiety may undermine this relationship by reinforcing avoidance behaviors and redu...
Fostering reading-culture of pre-teen community friends via reading play dates
Fostering reading-culture of pre-teen community friends via reading play dates
Background: Partly because of the lack of a culture of reading in many households, poor reading achievement remains a pressing, worsening problem in South Africa, which this study ...
The Russian schoolchildren's digital reading: Factors affecting medium preferences and self-evaluation of digital reading practice
The Russian schoolchildren's digital reading: Factors affecting medium preferences and self-evaluation of digital reading practice
Introduction. While the importance of digital reading in modern education is constantly increasing, there are some knowledge gaps in investigating reading patterns (reading digital...
Branje mladih leta 2009 in leta 2018 ter razlike v branju glede na spol in izobraževalni program
Branje mladih leta 2009 in leta 2018 ter razlike v branju glede na spol in izobraževalni program
Reading of the youth in 2009 and 2018: differences according to gender and national study programme The article presents reading characteristics of 15-year olds, as assessed in the...
Reading Culture in the Digital World
Reading Culture in the Digital World
The purpose of this study is to explore and to identify the existing problems of modern reading and to suggest possible solutions. The study focuses on the role of different forms ...
Oral reading miscues by intermediate Chinese as second language readers
Oral reading miscues by intermediate Chinese as second language readers
Reading is a universal skill, but it is also adapted to different written languages. Chinese writing lacks sound-to-spelling correspondence, visible word boundaries, and overt infl...

Back to Top