journal article Open Access Nov 23, 2017

Anomaly detection by robust statistics

View at Publisher Save 10.1002/widm.1236
Abstract
Real data often contain anomalous cases, also known as outliers. These may spoil the resulting analysis but they may also contain valuable information. In either case, the ability to detect such anomalies is essential. A useful tool for this purpose is robust statistics, which aims to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. We present an overview of several robust methods and the resulting graphical outlier detection tools. We discuss robust procedures for univariate, low‐dimensional, and high‐dimensional data, such as estimating location and scatter, linear regression, principal component analysis, classification, clustering, and functional data analysis. Also the challenging new topic of cellwise outliers is introduced.WIREs Data Mining Knowl Discov2018, 8:e1236. doi: 10.1002/widm.1236This article is categorized under:Algorithmic Development > Spatial and Temporal Data MiningTechnologies > ClassificationTechnologies > Structure Discovery and ClusteringTechnologies > Visualization
Topics

No keywords indexed for this article. Browse by subject →

References
78
[5]
Donoho DL (1983)
[7]
Hampel FR (1986)
[9]
Robust Estimation of a Location Parameter

Peter J. Huber

The Annals of Mathematical Statistics 10.1214/aoms/1177703732
[14]
Hubert M "Minimum covariance determinant and extensions" Wiley Interdiscip Rev Comput Stat (2017)
[17]
BoudtK RousseeuwPJ VanduffelS VerdonckT. The minimum regularized covariance determinant estimator. Technical Report arXiv:1701.07086 2017. 10.2139/ssrn.2905259
[18]
Stahel WA. Robuste Schätzungen: infinitesimale Optimalität und Schätzungen von Kovarianzmatrizen. PhD Thesis ETH Zürich 1981.
[19]
Donoho DL. Breakdown properties of multivariate location estimators. PhD Qualifying paper Department of Statistics Harvard University Boston 1982.
[21]
Rousseeuw PJ "A measure of directional outlyingness with applications to image data and video" J Comput Graph Stat (2017)
[24]
Tatsuoka KS "On the uniqueness of S‐functionals and M‐functionals under nonelliptical distributions" Ann Stat (2000) 10.1214/aos/1015956714
[31]
Brownlee KA (1965)
[35]
Robust Statistics

Peter J. Huber

Wiley Series in Probability and Statistics 10.1002/0471725250
[48]
Steinwart I (2008) 10.1007/978-0-387-77242-4

Showing 50 of 78 references

Cited By
193
Anomaly Detection on Data Streams for Smart Agriculture

Juliet Chebet Moso, Stéphane Cormier · 2021

Agriculture
A Robust Bootstrap Test for Mediation Analysis

Andreas Alfons, Nüfer Yasin Ateş · 2021

Organizational Research Methods
Mathematics
Metrics
193
Citations
78
References
Details
Published
Nov 23, 2017
Vol/Issue
8(2)
License
View
Cite This Article
Peter J. Rousseeuw, Mia Hubert (2017). Anomaly detection by robust statistics. WIREs Data Mining and Knowledge Discovery, 8(2). https://doi.org/10.1002/widm.1236
Related

You May Also Like

Ensemble learning: A survey

Omer Sagi, Lior Rokach · 2018

2,146 citations

Classification and regression trees

Wei‐Yin Loh · 2011

1,781 citations

Hyperparameters and tuning strategies for random forest

Philipp Probst, Marvin N. Wright · 2019

1,369 citations

Educational data mining and learning analytics: An updated survey

Cristobal Romero, Sebastián Ventura · 2020

663 citations

Bias in data‐driven artificial intelligence systems—An introductory survey

Eirini Ntoutsi, Pavlos Fafalios · 2020

652 citations