Statistical data mining

David L. Banks

doi:10.1002/wics.53

journal article Dec 31, 2009

Statistical data mining

David L. Banks

WIREs Computational Statistics Vol. 2 No. 1 pp. 9-25 · Wiley

View at Publisher Save 10.1002/wics.53

Abstract

AbstractData mining is widely used in modern science to extract signal from complex data sets. This article summarizes some of the key intellectual issues in the development of this field, largely from a historical perspective. There is particular emphasis on the Curse of Dimensionality, and its implications for non‐parametric regression, classification, and cluster analysis. Copyright © 2009 John Wiley & Sons, Inc.This article is categorized under:Statistical Learning and Exploratory Methods of the Data Sciences > Clustering and Classification

Topics

No keywords indexed for this article. Browse by subject →

References

34

[1]

The Elements of Statistical Learning

Trevor Hastie, Jerome Friedman, Robert Tibshirani

Springer Series in Statistics 10.1007/978-0-387-21606-5

[2]

Mitchell T (1997)

[3]

Bishop C (2008)

[4]

10.1515/9781400874668

[5]

Robust Locally Weighted Regression and Smoothing Scatterplots

William S. Cleveland

Journal of the American Statistical Association 10.1080/01621459.1979.10481038

[6]

10.1081/sac-120017506

[7]

Hastie T (1990)

[8]

Generalized Linear Models

P. McCullagh, J. A. Nelder

10.1007/978-1-4899-3242-6

[9]

Projection Pursuit Regression

Jerome H. Friedman, Werner Stuetzle

Journal of the American Statistical Association 10.1080/01621459.1981.10477729

[10]

10.1214/aos/1176347974

[11]

10.1109/18.256500

[12]

Breiman L (1984)

[13]

Multivariate Adaptive Regression Splines

Jerome H. Friedman

The Annals of Statistics 10.1214/aos/1176347963

[14]

Regression Shrinkage and Selection Via the Lasso

Robert Tibshirani

Journal of the Royal Statistical Society Series B:... 1996 10.1111/j.2517-6161.1996.tb02080.x

[15]

10.1111/j.1469-1809.1936.tb02137.x

[16]

10.1016/s0047-259x(02)00021-0

[17]

Vapnik V (1996)

[18]

10.1007/bf00994018

[19]

A training algorithm for optimal margin classifiers

Bernhard E. Boser, Isabelle M. Guyon, Vladimir N. Vapnik

Proceedings of the fifth annual workshop on Comput... 10.1145/130385.130401

[20]

Random Forests

Leo Breiman

Machine Learning 10.1023/a:1010933404324

[21]

The strength of weak learnability

Robert E. Schapire

Machine Learning 10.1007/bf00116037

[22]

Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors)

Jerome Friedman, Trevor Hastie, Robert Tibshirani

The Annals of Statistics 10.1214/aos/1016218223

[23]

Sibson R (1971)

[24]

10.1007/bf02294245

[25]

10.1093/biomet/58.1.91

[26]

10.1002/j.1538-7305.1957.tb01515.x

[27]

10.1111/j.1467-9868.2004.02059.x

[28]

10.1007/bf01898350

[29]

MacqueenJB. Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability University of California Press 1967;281–297.

[30]

10.1007/978-3-642-88163-3

[31]

10.1214/cbms/1462106013

[32]

Maximum Likelihood from Incomplete Data Via the EM Algorithm

A. P. Dempster, N. M. Laird, D. B. Rubin

Journal of the Royal Statistical Society Series B:... 1977 10.1111/j.2517-6161.1977.tb01600.x

[33]

10.2307/2532201

[34]

10.1073/pnas.0502269102

Metrics

8

Citations

34

References

Details

Published: Dec 31, 2009
Vol/Issue: 2(1)
Pages: 9-25
License: View

Authors

D

David L. Banks

Cite This Article

David L. Banks (2009). Statistical data mining. WIREs Computational Statistics, 2(1), 9-25. https://doi.org/10.1002/wics.53

Statistical data mining

You May Also Like