Multiple Outliers in Small Samples

Rachel Traylor

Multiple Outliers in Small Samples

Abstract

Z-scores are often employed in outlier detection in a dataset. For small samples, the presence of multiple outliers forces a finite supremum on the absolute value of possible z-scores that decreases with an increasing number of outliers, creating a "masking effect" that hinders identification of true outliers. We give an illustrative case study in which the accurate detection of the number of outliers is critical, and provide a closed form expression of the maximum possible z-score in terms of the sample size and number of outliers. In addition, a corresponding analysis on the t-statistic is performed.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…