Appraising the Quality of Medical Education Research Methods

Cook, David A.; Reed, Darcy A.

doi:10.1097/acm.0000000000000786

articleAcademic MedicineJun 24, 2015Closed access

Appraising the Quality of Medical Education Research Methods

DADavid A. Cook DADarcy A. Reed

Mayo Clinic · Mayo Clinic in Arizona · +1 more institution

PubMed

Indexed incrossrefpubmed

Abstract

Results

Each instrument contains items concerning sampling, controlling for confounders, and integrity of outcomes. Interrater reliability for overall scores ranged from 0.68 to 0.95. Interrater reliability was "substantial" or better (ICC > 0.60) for nearly all domain-specific items on both instruments. Most instances of low interrater reliability were associated with restriction of range, and raw agreement was usually good. Across 26 studies evaluating published research, the median overall MERSQI score was 11.3 (range 8.9-15.1, of possible 18). Across six studies, the median overall NOS-E score was 3.22 (range 2.08-3.82, of possible 6). Overall MERSQI and NOS-E scores correlated reasonably well (rho 0.49-0.72).

Conclusions

The MERSQI and NOS-E are useful, reliable, complementary tools for appraising methodological quality of medical education research. Interpretation and use of their scores should focus on item-specific codes rather than overall scores. Normative scores should be used for relative rather than absolute judgments because different research questions require different study designs.

Citation impact

783

total citations

FWCI: 16.02
Percentile: 100%
References: 54

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Inter-rater reliability
Intraclass correlation
Normative
Reliability (semiconductor)
Psychology
Raw score
Scale (ratio)
Medicine

UN Sustainable Development Goals

Quality Education

No related works found for this paper.