Javascript must be enabled to continue!
Does Reproducibility Drive Clinical Accuracy?
View through CrossRef
Abstract
Objectives
To develop a stochastic model relating measurement uncertainty, including reproducibility, to clinical accuracy, as demonstrated by the receiver operating characteristic curve.
Methods
A model is developed based on the symmetric case of the well-known binormal distribution. The overall distribution is partitioned further into analytical and biological components based on assumptions derived from the Cotlove criterion. Explicit mathematical solutions are derived and further verified by Monte Carlo analyses.
Results
The model demonstrates that tests with analytical error that conforms to the classic Cotlove criterion can achieve receiver operating characteristic curves with areas under the curve of 0.68 to 0.76 and Youden indices of 0.26 to 0.38 but have overall agreement for duplicate measurements of only 80% to 82%. Furthermore, the analytically accurate agreement is only 75% to 78%, and the clinically accurate agreement is only 50% to 60%.
Conclusions
The model suggests that assays may have reasonable clinical accuracy despite having reproducibility of less than 85%. Imperfect assays can substantially improve medical decision-making. The findings must be interpreted with caution given the binormal assumptions, but such assumptions are often useful as a first approximation. Practicing pathologists should feel comfortable performing semiquantitative assays shown to have a strong biological association with clinical outcome.
Title: Does Reproducibility Drive Clinical Accuracy?
Description:
Abstract
Objectives
To develop a stochastic model relating measurement uncertainty, including reproducibility, to clinical accuracy, as demonstrated by the receiver operating characteristic curve.
Methods
A model is developed based on the symmetric case of the well-known binormal distribution.
The overall distribution is partitioned further into analytical and biological components based on assumptions derived from the Cotlove criterion.
Explicit mathematical solutions are derived and further verified by Monte Carlo analyses.
Results
The model demonstrates that tests with analytical error that conforms to the classic Cotlove criterion can achieve receiver operating characteristic curves with areas under the curve of 0.
68 to 0.
76 and Youden indices of 0.
26 to 0.
38 but have overall agreement for duplicate measurements of only 80% to 82%.
Furthermore, the analytically accurate agreement is only 75% to 78%, and the clinically accurate agreement is only 50% to 60%.
Conclusions
The model suggests that assays may have reasonable clinical accuracy despite having reproducibility of less than 85%.
Imperfect assays can substantially improve medical decision-making.
The findings must be interpreted with caution given the binormal assumptions, but such assumptions are often useful as a first approximation.
Practicing pathologists should feel comfortable performing semiquantitative assays shown to have a strong biological association with clinical outcome.
Related Results
METHODOLOGY OF THE DESIGN CALCULATION OF THE ELECTRO-HYDRAULIC SERVO DRIVE OF TECHNOLOGICAL EQUIPMENT
METHODOLOGY OF THE DESIGN CALCULATION OF THE ELECTRO-HYDRAULIC SERVO DRIVE OF TECHNOLOGICAL EQUIPMENT
Machine-building industries and enterprises for modernization of railway rolling stock are constantly increasing the requirements for the technical and functional char...
Epidemiological characteristics and prevalence rates of research reproducibility across disciplines: A scoping review of articles published in 2018-2019
Epidemiological characteristics and prevalence rates of research reproducibility across disciplines: A scoping review of articles published in 2018-2019
Background:Reproducibility is a central tenant of research. We aimed to synthesize the literature on reproducibility and describe its epidemiological characteristics, including how...
Assessment of transparent and reproducible research practices in the psychiatry literature
Assessment of transparent and reproducible research practices in the psychiatry literature
Background
Reproducibility is a cornerstone of scientific advancement; however, many published works may lack the core components needed for study reproducibility...
Interobserver and Intraobserver Reproducibility and Reliability of the Huashan Clinical Classification System for Hirayama Disease
Interobserver and Intraobserver Reproducibility and Reliability of the Huashan Clinical Classification System for Hirayama Disease
Purpose: The Huashan clinical classification system for Hirayama disease has recently been proposed and has been found useful for diagnosis and treatment. So far, however, there ha...
BioModels Reproducibility Scorecard
BioModels Reproducibility Scorecard
Systems biology modelling involves the mathematical representation of biological processes to study complex system behaviour and was expected to be least affected by the reproducib...
Assessment of transparent and reproducible research practices in the psychiatry literature
Assessment of transparent and reproducible research practices in the psychiatry literature
Objective: Reproducibility is a cornerstone of scientific advancement; however, many published works may lack the core components needed for study reproducibility. In this study, w...
A retrospective evaluation of reliability and reproducibility of Arbeitsgemeinschaftfür Osteosynthesefragen classification and Fernandez classification for distal radius fracture
A retrospective evaluation of reliability and reproducibility of Arbeitsgemeinschaftfür Osteosynthesefragen classification and Fernandez classification for distal radius fracture
Abstract
This is a retrospective evaluated. The objective of this study was to test the interobserver reliability and intraobserver reproducibility of fracture classifi...
Testing inter-scanner reproducibility of CrCEST at 3T
Testing inter-scanner reproducibility of CrCEST at 3T
Motivation: CrCEST MRI can map muscle-specific oxidative phosphorylation (OXPHOS), yet its reproducibility across different scanners has not been tested. Goal(s): To evaluate the i...

