Javascript must be enabled to continue!
Accuracy, Precision, And Agreement Statistical Tests For Bland-Altman Method
View through CrossRef
Abstract
Background: Bland and Altman plot method is a widely cited graphical approach to assess equivalence of quantitative measurement techniques. Perhaps due to its graphical output, it has been widely applied, however often misinterpreted by lacking of inferential statistical support. To compare data sets obtained from two measurement techniques, researchers may apply Pearson’s correlation, ordinal least-square linear regression, or the Bland-Altman plot methods, failing to locate the weakness of each measurement technique. We aim to develop and distribute a statistical method in R in order to add robust and suitable inferential statistics of equivalence. Methods: Three nested tests based on structural regressions are proposed to assess the equivalence of structural means (accuracy), equivalence of structural variances (precision), and concordance with the structural bisector line (agreement in measurements of data pairs obtained from the same subject) to reach statistical support for the equivalence of measurement techniques. Graphical outputs illustrating these three tests were added to follow Bland and Altman’s principles of easy communication. Results: Statistical p-values and robust approach by bootstrapping with corresponding graphs provide objective, robust measures of equivalence. Five pairs of data sets were analyzed in order to criticize previously published articles that applied the Bland and Altman’s principles, thus showing the suitability of the present statistical approach. In one case it was demonstrated strict equivalence, three cases showed partial equivalence, and one case showed poor equivalence. Package containing open codes and data is available with installation instructions on SourceForge for free distribution. Conclusions: Statistical p-values and robust approach assess the equivalence of accuracy, precision, and agreement for measurement techniques. Decomposition in three tests helps the location of any disagreement as a means to fix a new technique.
Research Square Platform LLC
Title: Accuracy, Precision, And Agreement Statistical Tests For Bland-Altman Method
Description:
Abstract
Background: Bland and Altman plot method is a widely cited graphical approach to assess equivalence of quantitative measurement techniques.
Perhaps due to its graphical output, it has been widely applied, however often misinterpreted by lacking of inferential statistical support.
To compare data sets obtained from two measurement techniques, researchers may apply Pearson’s correlation, ordinal least-square linear regression, or the Bland-Altman plot methods, failing to locate the weakness of each measurement technique.
We aim to develop and distribute a statistical method in R in order to add robust and suitable inferential statistics of equivalence.
Methods: Three nested tests based on structural regressions are proposed to assess the equivalence of structural means (accuracy), equivalence of structural variances (precision), and concordance with the structural bisector line (agreement in measurements of data pairs obtained from the same subject) to reach statistical support for the equivalence of measurement techniques.
Graphical outputs illustrating these three tests were added to follow Bland and Altman’s principles of easy communication.
Results: Statistical p-values and robust approach by bootstrapping with corresponding graphs provide objective, robust measures of equivalence.
Five pairs of data sets were analyzed in order to criticize previously published articles that applied the Bland and Altman’s principles, thus showing the suitability of the present statistical approach.
In one case it was demonstrated strict equivalence, three cases showed partial equivalence, and one case showed poor equivalence.
Package containing open codes and data is available with installation instructions on SourceForge for free distribution.
Conclusions: Statistical p-values and robust approach assess the equivalence of accuracy, precision, and agreement for measurement techniques.
Decomposition in three tests helps the location of any disagreement as a means to fix a new technique.
Related Results
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Abstract
Introduction
The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...
A Review of the Use of Confidence Intervals for Bland-Altman Limits of Agreement in Optometry and Vision Science
A Review of the Use of Confidence Intervals for Bland-Altman Limits of Agreement in Optometry and Vision Science
SIGNIFICANCE
Confidence intervals are still seldom reported for Bland-Altman 95% limits of agreement. When they are reported, 50% of articles use approximate methods an...
Hur mår HBTIQ-unga i Finland?
Hur mår HBTIQ-unga i Finland?
Under våren 2013 kartlades välmående bland unga HBTIQ-personer i Finland (HBTIQ står för homosexuell, bisexuell, transperson, interkönad och queer) i en omfattande nätenkät. Unders...
Agreement Analysis: What He Said, She Said Versus You Said
Agreement Analysis: What He Said, She Said Versus You Said
Correlation and agreement are 2 concepts that are widely applied in the medical literature and clinical practice to assess for the presence and strength of an association. However,...
Altman in Joyce’s Work, from Dubliners to Ulysses
Altman in Joyce’s Work, from Dubliners to Ulysses
Reading postcolonial Joyce through Altman’s presence in his works. Backgrounds on Dubliners’ story “Ivy Day in the Committee Room” and Joyce’s interest in Triestine Jewry during it...
Agreement with collective nouns: Diachronic corpus studies of American and British English
Agreement with collective nouns: Diachronic corpus studies of American and British English
English collective nouns and their agreement patterns have been extensively studied in corpus linguistics. Previous research has highlighted variability within and across English v...
Evaluation of the Precision Xtra meter for monitoring blood β-hydroxybutyrate concentrations in late-gestation ewes
Evaluation of the Precision Xtra meter for monitoring blood β-hydroxybutyrate concentrations in late-gestation ewes
Blood samples were collected from late-gestation ewes to determine the agreement of a point-of-care (POC) Precision Xtra meter and a standard laboratory test for β-hydroxybutyrate ...
Introduction: Autumnal Altman—Rethinking his Last Quarter Century
Introduction: Autumnal Altman—Rethinking his Last Quarter Century
This introduction reviews and challenges the dominant narrative regarding the periodization of Robert Altman’s film career, makes a case for focusing on the last quarter century of...


