Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

On the Statistical and Practical Limitations of Thurstonian IRT Models

View through CrossRef
Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed. For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation. Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating force-choice questionnaires that have the potential to resist faking intentions. According to extensive simulations, measuring up to 5 traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor Bayesian estimation methods. As a result, persons' trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons. However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizeable number of traits. More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate. We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizeable number of traits.
Title: On the Statistical and Practical Limitations of Thurstonian IRT Models
Description:
Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires.
To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed.
For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation.
Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating force-choice questionnaires that have the potential to resist faking intentions.
According to extensive simulations, measuring up to 5 traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor Bayesian estimation methods.
As a result, persons' trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons.
However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizeable number of traits.
More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate.
We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizeable number of traits.

Related Results

Application of Alternative IRT models to IRT Assumption Violation
Application of Alternative IRT models to IRT Assumption Violation
The purpose of this study is to investigate the most appropriate alternative IRT parameter estimation models among bi-factor model, testlet based model, and second-order IRT model ...
Improving Cystic Fibrosis Screening Through a Novel Testing Design
Improving Cystic Fibrosis Screening Through a Novel Testing Design
Newborn screening (NBS) plays a key role in detecting life-threatening genetic disorders, with cystic fibrosis (CF) being a prominent example. Having CF requires both copies of the...
Developing an Integrated Rural Tourism Model for Stakeholders in Yuanjia Village, China
Developing an Integrated Rural Tourism Model for Stakeholders in Yuanjia Village, China
This research aims to propose an Integrated Rural Tourism (IRT) development model for stakeholders in Yuanjia village, China. Although IRT has been widely discussed, research rarel...
Parsimonious item response theory modeling with different link functions
Parsimonious item response theory modeling with different link functions
[EMBARGOED UNTIL 6/1/2023] Traditional item response theory (IRT) models assume a symmetric error distribution and rely on symmetric (logit or probit) link functions to model the r...
Accuracy of Icare Rebound Tonometer and Its Comparison with Goldman Applanation Tonometer
Accuracy of Icare Rebound Tonometer and Its Comparison with Goldman Applanation Tonometer
Purpose:  To determine accuracy of iCare rebound tonometer (IRT) in terms of agreement with Goldman Applanation Tonometer (GAT) and effect of Central corneal thickness (CCT) on its...
Sensitivity Of Differential Item Functioning Detection Methods On National Mathematics Examination In North Sumatera Province, Indonesia
Sensitivity Of Differential Item Functioning Detection Methods On National Mathematics Examination In North Sumatera Province, Indonesia
The purpose of this study was to examine the differences in sensitivity of three methods: IRT-Likelihood Ratio (IRT-LR), Mantel-Haenszel (MH) and Logistics Regression (LR), in dete...
SOSIALISASI DROPSHIP SEBAGAI PELUANG BISNIS TANPA MODAL BAGI KADER PKK DAN IRT DI KELURAHAN GUNUNG BAHAGIA BALIKPAPAN
SOSIALISASI DROPSHIP SEBAGAI PELUANG BISNIS TANPA MODAL BAGI KADER PKK DAN IRT DI KELURAHAN GUNUNG BAHAGIA BALIKPAPAN
ABSTRAK Kelurahan Gunung Bahagia Balikpapan menaungi kader PKK yang beranggotakan ibu rumah dan remaja dimana sudah banyak kegiatan yang dilakukan mampu menunjukkan prestasi yang ...

Back to Top