Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

High Agreement and High Prevalence: The Paradox of Cohen’s Kappa

View through CrossRef
Background: Cohen's Kappa is the most used agreement statistic in literature. However, under certain conditions, it is affected by a paradox which returns biased estimates of the statistic itself. Objective: The aim of the study is to provide sufficient information which allows the reader to make an informed choice of the correct agreement measure, by underlining some optimal properties of Gwet’s AC1 in comparison to Cohen’s Kappa, using a real data example. Method: During the process of literature review, we have asked a panel of three evaluators to come up with a judgment on the quality of 57 randomized controlled trials assigning a score to each trial using the Jadad scale. The quality was evaluated according to the following dimensions: adopted design, randomization unit, type of primary endpoint. With respect to each of the above described features, the agreement between the three evaluators has been calculated using Cohen’s Kappa statistic and Gwet’s AC1 statistic and, finally, the values have been compared with the observed agreement. Results: The values of the Cohen’s Kappa statistic would lead to believe that the agreement levels for the variables Unit, Design and Primary Endpoints are totally unsatisfactory. The AC1 statistic, on the contrary, shows plausible values which are in line with the respective values of the observed concordance. Conclusion: We conclude that it would always be appropriate to adopt the AC1 statistic, thus bypassing any risk of incurring the paradox and drawing wrong conclusions about the results of agreement analysis.
Title: High Agreement and High Prevalence: The Paradox of Cohen’s Kappa
Description:
Background: Cohen's Kappa is the most used agreement statistic in literature.
However, under certain conditions, it is affected by a paradox which returns biased estimates of the statistic itself.
Objective: The aim of the study is to provide sufficient information which allows the reader to make an informed choice of the correct agreement measure, by underlining some optimal properties of Gwet’s AC1 in comparison to Cohen’s Kappa, using a real data example.
Method: During the process of literature review, we have asked a panel of three evaluators to come up with a judgment on the quality of 57 randomized controlled trials assigning a score to each trial using the Jadad scale.
The quality was evaluated according to the following dimensions: adopted design, randomization unit, type of primary endpoint.
With respect to each of the above described features, the agreement between the three evaluators has been calculated using Cohen’s Kappa statistic and Gwet’s AC1 statistic and, finally, the values have been compared with the observed agreement.
Results: The values of the Cohen’s Kappa statistic would lead to believe that the agreement levels for the variables Unit, Design and Primary Endpoints are totally unsatisfactory.
The AC1 statistic, on the contrary, shows plausible values which are in line with the respective values of the observed concordance.
Conclusion: We conclude that it would always be appropriate to adopt the AC1 statistic, thus bypassing any risk of incurring the paradox and drawing wrong conclusions about the results of agreement analysis.

Related Results

Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Abstract Introduction The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...
North Syrian Mortaria and Other Late Roman Personal and Utility Objects Bearing Inscriptions of Good Luck
North Syrian Mortaria and Other Late Roman Personal and Utility Objects Bearing Inscriptions of Good Luck
<span style="font-size: 11pt; color: black; font-family: 'Times New Roman','serif'">&Pi;&Eta;&Lambda;&Iota;&Nu;&Alpha; &Iota;&Gamma;&Delta...
Un manoscritto equivocato del copista santo Theophilos († 1548)
Un manoscritto equivocato del copista santo Theophilos († 1548)
<p><font size="3"><span class="A1"><span style="font-family: 'Times New Roman','serif'">&Epsilon;&Nu;&Alpha; &Lambda;&Alpha;&Nu;&...
Restricted kappa chain expression in early ontogeny: biased utilization of V kappa exons and preferential V kappa-J kappa recombinations.
Restricted kappa chain expression in early ontogeny: biased utilization of V kappa exons and preferential V kappa-J kappa recombinations.
To determine the extent of kappa chain diversity in the preimmune repertoire early in development, kappa cDNA libraries were analyzed from 15-d old fetal omentum, 18-d-old fetal li...
AN ANALYSIS OF EXPECTANCY VALUES IN FRONT OF THE CLASS FILM
AN ANALYSIS OF EXPECTANCY VALUES IN FRONT OF THE CLASS FILM
ABSTRACTThis research intended to find the values of expectancy in Front of the Class film. The researcher analyzed the values of expectancy that depicted through Brad Cohen charac...
A Randomized Controlled Pilot Study using Mind–Body Interventions among Refugees in Sweden
A Randomized Controlled Pilot Study using Mind–Body Interventions among Refugees in Sweden
Background: Migration is one of the major challenges of the 21st century with many refugees being victims of torture and experiencing war and the collapse of their society. Sweden,...
Organizational Paradox
Organizational Paradox
Organizational paradox offers a theory of the nature and management of competing demands. Historically, the dominant paradigm in organizational theory depicted competing demands as...

Back to Top