Abstract
Many anomaly detection techniques consider all the data-space dimensions when looking for outliers, and some others consider only specific subspaces, in isolation from other subspaces. However, interesting information about anomalous data points is embedded in the inter-relationships of the subspaces within which the data points appear to be outliers. Important characteristics of a dataset can be revealed by looking at these inter-relationships among subspaces. We describe a methodology for searching for outliers within the context of contiguous subspaces in the subspace lattice of a domain. We demonstrate additional insights about the outliers gained from this approach compared to finding the outliers in only specific subspaces or in the complete data-space. This additional information points an analyst to peculiar sets of subspaces to investigate further the underlying structure of the data space and also of the anomalous nature of the data points.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Knorr, E.M., Ng, R.T.: Finding Intensional Knowledge of Distance-Based Outliers. In: Proceedings of 25th International Conference on Very Large Data Bases VLDB 1999, Edinburgh, Scotland, UK, September 7-10, pp. 211–222. Morgan Kaufmann (1999)
Keller, F., Muller, E.B.K.: HiCS: High Contrast Subspaces for Density-Based Outlier Ranking Data Engineering (ICDE). In: 2012 IEEE 28th International Conference on, pp. 1037–1048 (2012)
Zhang, J., Lou, M., Ling, T.W., Wang, H.: HOS-Miner: A System for Detecting Outlying Subspaces of High-dimensional Data. In: Proceedings 2004 VLDB Conference, pp. 1265–1268. Morgan Kaufmann (2004)
Aggarwal, C.C.: Outlier analysis. Springer (2013)
Breunig, M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. SIGMOD Rec. 29(2), 93–104 (2000), http://doi.acm.org/10.1145/335191.335388 , doi:10.1145/335191.335388
Angiulli, F., Pizzuti, C.: Outlier mining in large high-dimensional data sets. IEEE Transactions on Knowledge and Data Engineering 17, 203–215 (2005)
Nguyen, H.V., Gopalkrishnan, V., Assent, I.: An unbiased distance-based outlier detection approach for high-dimensional data. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011, Part I. LNCS, vol. 6587, pp. 138–152. Springer, Heidelberg (2011)
Ebdon, D.: Statistics in geography. John Wiley and Sons (1985)
Bache, K., Lichman, M.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2013), http://archive.ics.uci.edu/ml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Joshi, V., Bhatnagar, R. (2014). Outlier Analysis Using Lattice of Contiguous Subspaces. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, YS. (eds) Active Media Technology. AMT 2014. Lecture Notes in Computer Science, vol 8610. Springer, Cham. https://doi.org/10.1007/978-3-319-09912-5_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-09912-5_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09911-8
Online ISBN: 978-3-319-09912-5
eBook Packages: Computer ScienceComputer Science (R0)