Abstract
In high-dimensional data, one often seeks a few interesting low-dimensional projections which reveal important aspects of the data. Projection pursuit for classification finds projections that reveal differences between classes. Even though projection pursuit is used to bypass the curse of dimensionality, most indexes will not work well when there are a small number of observations relative to the number of variables, known as a large p (dimension) small n (sample size) problem. This paper discusses the relationship between the sample size and dimensionality on classification and proposes a new projection pursuit index that overcomes the problem of small sample size for exploratory classification.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Dudoit, S., Fridlyand, J., Speed, T.P.: Comparison of discrimination methods for the classification of tumors using gene expression data. J. Am. Stat. Assoc. 97(1), 77–87 (2002)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 268, 531–537 (1999)
Good, P.: Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses. Springer, Berlin (2000)
Hastie, T., Buja, A., Tibshirani, R.: Penalized discriminant analysis. Ann. Stat. 23(1), 73–102 (1995)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, Berlin (2001)
Huber, P.J.: Projection pursuit (with discussion). Ann. Stat. 13, 435–525 (1985)
Huber, P.J.: Data Analysis and Projection Pursuit. Technical Report PJH-90-1, MIT (1990)
Ihaka, R., Gentleman, R.: A language for data analysis and graphics. J. Comput. Graph. Stat. 5(3), 299–314 (1996)
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 4th edn. Prentice-Hall, New Jersey (1998)
Lee, E., Cook, D., Klinke, S., Lumley, T.: Projection pursuit for exploratory supervised classification. J. Comput. Graph. Stat. 14(4), 831–846 (2005)
Ligges, U.: tuneR: Analysis of music. http://www.r-project.org
Marron, J.S., Todd, M.: Distance weighted discrimination. Optimization Online Digest, July (2002)
Ripley, B.D.: Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge (1996)
Swayne, D.F., Lang, D.T., Buja, A., Cook, D.: GGobi: evolving from XGobi into an extensible framework for interactive data visualization. Comput. Stat. Data Anal. 43(4), 423–444 (2003)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, EK., Cook, D. A projection pursuit index for large p small n data. Stat Comput 20, 381–392 (2010). https://doi.org/10.1007/s11222-009-9131-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11222-009-9131-1