Measurement of Observer Agreement

Kundel, Harold L.; Polansky, Marcia

doi:10.1148/radiol.2282011860

articleRadiologyAug 1, 2003Closed access

Measurement of Observer Agreement

HLHarold L. Kundel MPMarcia Polansky

University of Pennsylvania

PubMed

Indexed incrossrefpubmed

Abstract

Statistical measures are described that are used in diagnostic imaging for expressing observer agreement in regard to categorical data. The measures are used to characterize the reliability of imaging methods and the reproducibility of disease classifications and, occasionally with great care, as the surrogate for accuracy. The review concentrates on the chance-corrected indices, kappa and weighted kappa. Examples from the imaging literature illustrate the method of calculation and the effects of both disease prevalence and the number of rating categories. Other measures of agreement that are used less frequently, including multiple-rater kappa, are referenced and described briefly.

Citation impact

1,500

total citations

FWCI: 52.47
Percentile: 100%
References: 28

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Kappa
Medicine
Categorical variable
Cohen's kappa
Reliability (semiconductor)
Reproducibility
Inter-rater reliability
Statistics

No related works found for this paper.