The power of outliers (and why researchers should ALWAYS check for them)
North Carolina State University
Indexed indatacite
Abstract
There has been much debate in the literature regarding what to do with extreme or influential data points. The goal of this paper is to summarize the various potential causes of extreme scores in a data set (e.g., data recording or entry errors, motivated mis-reporting, sampling errors, and legitimate sampling), how to detect them, and whether they should be removed or not. Another goal of this paper was to explore how significantly a small proportion of outliers can affect even simple analyses. The examples show a strong beneficial effect of removal of extreme scores. Accuracy tended to increase significantly and substantially, and errors of inference tended to drop significantly and substantially once…
Citation impact
878
total citations
- FWCI
- 53.78
- Percentile
- 100%
- References
- 27
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Outlier
- Statistic
- Nonparametric statistics
- Inference
- Statistics
- Econometrics
- Casual
- Computer science
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.