Exploratory data analysis (EDA) is a term first utilized by John Tukey (1977), and is intended to contrast with the more traditional statistical approach to data analysis that starts with hypothesis testing and model building. Instead of using confirmatory data analysis (CDA) methods to verify or refute suspected hypotheses, Tukey (1977) advocated the use of basic descriptive statistics and visualization methods to generate information that would lead to the development of hypotheses to test. The objectives of EDA (see Velleman and Hoaglin 1981) are therefore to provide statistical summaries of the data as a pre-processing step before building a model or testing hypotheses. The EDA phase enables new hypotheses to be suggested about the causes of the relationships in the data, and provides an opportunity to determine whether the assumption that various models rely upon are valid for the particular dataset. It provides us with a very rapid feel for the data: the shape of its...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References and Further Reading
Borg I, Groenen PJF (2005) Modern multidimensional scaling: theory and applications. Springer-Verlag, New York
Deboeck GJ, Kohonen TK (1998) Visual explorations in finance. Springer-Verlag, New York
Emerson JD, Hoaglin DC (1983) Analysis of two-way tables by medians. In: Hoaglin DC, Mosteller F, Tukey JW (eds) Understanding robust and exploratory data analysis Wiley, New York, pp 165–210
Hair JF (2006) Multivariate data analysis, Prentice-Hall, New Jersey
Kohonen T (1989) Self-organization and associative memory. Springer-Verlag, Berlin
Maimon OZ, Rokash L (2005) Data mining and knowledge discovery handbook, Springer, New York
Pyle D (1999) Data preparation for data mining, Morgan Kaufmann, San Francisco
Tukey JW (1977) Exploratory data analysis. Addison-Wesley, Reading, MA
Velleman PF, Hoaglin DC (1981) The ABC’s of EDA. Duxbury Press, Boston, MA
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this entry
Cite this entry
Smith-Miles, K. (2011). Exploratory Data Analysis. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_242
Download citation
DOI: https://doi.org/10.1007/978-3-642-04898-2_242
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04897-5
Online ISBN: 978-3-642-04898-2
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering