Abstract
An advanced approach to Bayesian classification is based on exploited patterns. However, traditional pattern-based Bayesian classifiers cannot adapt to the evolving data stream environment. For that, an effective Pattern-based Bayesian classifier for Data Stream (PBDS) is proposed. First, a data-driven lazy learning strategy is employed to discover local frequent patterns for each test record. Furthermore, we propose a summary data structure for compact representation of data, and to find patterns more efficiently for each class. Greedy search and minimum description length combined with Bayesian network are applied to evaluating extracted patterns. Experimental studies on real-world and synthetic data streams show that PBDS outperforms most state-of-the-art data stream classifiers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
\(P(y_i|\mathbf x ) = \frac{P(\mathbf x ,y_i)}{P(\mathbf x )}=\frac{P(y_i) \cdot P(\mathbf x |y_i)}{P(\mathbf x )}\).
- 2.
Datasets Chess, Connect-4, EEG, MAGIC, PokerHand and CoverType are downloaded from UCI Machine Learning Repository http://archive.ics.uci.edu/ml/; Others are generated by the classical data generators separately with 1,000,000 records via MOA.
References
Baralis, E., Cagliero, L., Garza, P.: Enbay: a novel pattern-based bayesian classifier. IEEE Trans. Knowl. Data Eng. 25(12), 2780–2795 (2013)
Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. J. Mach. Learn. Res. 11(May), 1601–1604 (2010)
Bifet, A., Pfahringer, B., Read, J., Holmes, G.: Efficient data stream classification via probabilistic adaptive windows. In: Proceedings of the 28th Annual ACM Symposium on Applied Computing, pp. 801–806. ACM (2013)
Cheng, H., Yan, X., Han, J., Hsu, C.W.: Discriminative frequent pattern analysis for effective classification. In: 2007 IEEE 23rd International Conference on Data Engineering, pp. 716–725. IEEE (2007)
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Machine Learning, pp. 1022–1027 (1993)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)
Gama, J., Kosina, P., et al.: Learning decision rules from data streams. In: IJCAI Proceedings-International Joint Conference on Artificial Intelligence, vol. 22, p. 1255 (2011)
Gomes, H.M., Barddal, J.P., Enembreck, F., Bifet, A.: A survey on ensemble learning for data stream classification. ACM Comput. Surv. (CSUR) 50(2), 23 (2017)
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 97–106. ACM (2001)
Li, H.F., Shan, M.K., Lee, S.Y.: DSM-FI: an efficient algorithm for mining frequent itemsets in data streams. Knowl. Inf. Syst. 17(1), 79–97 (2008)
Meretakis, D., Wüthrich, B.: Extending Naive Bayes classifiers using long itemsets. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 165–174. ACM (1999)
Sun, Y., Wang, Z., Liu, H., Du, C., Yuan, J.: Online ensemble using adaptive windowing for data streams with concept drift. Int. J. Distrib. Sens. Netw. 12, 4218973 (2016)
Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235. ACM (2003)
Yuan, J., Wang, Z., Han, M., Sun, Y.: A lazy associative classifier for time series. Intell. Data Anal. 19(5), 983–1002 (2015)
Acknowledgments
This work is supported by National Natural Science Foundation of China (Nos. 61672086 and 61702030) and the Fundamental Research Funds for the Central Universities (Nos. 2016RC048 and 2016YJS036).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Yuan, J., Wang, Z., Sun, Y., Zhang, W., Jiang, J. (2017). A Pattern-Based Bayesian Classifier for Data Stream. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10637. Springer, Cham. https://doi.org/10.1007/978-3-319-70093-9_92
Download citation
DOI: https://doi.org/10.1007/978-3-319-70093-9_92
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70092-2
Online ISBN: 978-3-319-70093-9
eBook Packages: Computer ScienceComputer Science (R0)