Posts about Outliers

Yet another robust outlier detector

Outlier detection is an important step in data processing. Unfortunately, if the distribution is not normal (e.g., right-skewed and heavy-tailed), it's hard to choose a robust outlier detection algorithm that will not be affected by tricky distribution properties. During the last several years, I tried many different approaches, but I was not satisfied with their results. Finally, I found an algorithm to which I have (almost) no complaints. It's based on the double median absolute deviation and the Harrell-Davis quantile estimator. In this post, I will show how it works and why it's better than some other approaches.

Read more