Javascript must be enabled to continue!

On the Statistical and Practical Limitations of Thurstonian IRT Models

Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed. For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation. Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating force-choice questionnaires that have the potential to resist faking intentions. According to extensive simulations, measuring up to 5 traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor Bayesian estimation methods. As a result, persons' trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons. However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizeable number of traits. More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate. We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizeable number of traits.

Center for Open Science

Paul - Christian Bürkner Niklas Schulte Heinz Holling

2018

Title: On the Statistical and Practical Limitations of Thurstonian IRT Models

Description:

Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires.

To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed.

For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation.

Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating force-choice questionnaires that have the potential to resist faking intentions.

According to extensive simulations, measuring up to 5 traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor Bayesian estimation methods.

As a result, persons' trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons.

However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizeable number of traits.

More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate.

We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizeable number of traits.

Back

The purpose of this study is to investigate the most appropriate alternative IRT parameter estimation models among bi-factor model, testlet based model, and second-order IRT model ...

Improving Cystic Fibrosis Screening Through a Novel Testing Design

Newborn screening (NBS) plays a key role in detecting life-threatening genetic disorders, with cystic fibrosis (CF) being a prominent example. Having CF requires both copies of the...

Developing an Integrated Rural Tourism Model for Stakeholders in Yuanjia Village, China

This research aims to propose an Integrated Rural Tourism (IRT) development model for stakeholders in Yuanjia village, China. Although IRT has been widely discussed, research rarel...

Parsimonious item response theory modeling with different link functions

[EMBARGOED UNTIL 6/1/2023] Traditional item response theory (IRT) models assume a symmetric error distribution and rely on symmetric (logit or probit) link functions to model the r...

Accuracy of Icare Rebound Tonometer and Its Comparison with Goldman Applanation Tonometer

Purpose: To determine accuracy of iCare rebound tonometer (IRT) in terms of agreement with Goldman Applanation Tonometer (GAT) and effect of Central corneal thickness (CCT) on its...

Investigating the Perceptions and Attitudes of ESL Learners Towards theUse of Immersive Reader Technology in Enhancing Reading Comprehension at the Secondary School Level

Introduction: This article delves into the utilization of Immersive Reader Technology (IRT) as a tool to enhance reading comprehension among ESL students in secondary schools. It e...

Sensitivity Of Differential Item Functioning Detection Methods On National Mathematics Examination In North Sumatera Province, Indonesia

The purpose of this study was to examine the differences in sensitivity of three methods: IRT-Likelihood Ratio (IRT-LR), Mantel-Haenszel (MH) and Logistics Regression (LR), in dete...

SOSIALISASI DROPSHIP SEBAGAI PELUANG BISNIS TANPA MODAL BAGI KADER PKK DAN IRT DI KELURAHAN GUNUNG BAHAGIA BALIKPAPAN

ABSTRAK Kelurahan Gunung Bahagia Balikpapan menaungi kader PKK yang beranggotakan ibu rumah dan remaja dimana sudah banyak kegiatan yang dilakukan mampu menunjukkan prestasi yang ...

Email:
Password:

Email:

On the Statistical and Practical Limitations of Thurstonian IRT Models

Related Results