A Pattern-Based Bayesian Classifier for Data Stream | SpringerLink
Skip to main content

A Pattern-Based Bayesian Classifier for Data Stream

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2017)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10637))

Included in the following conference series:

  • 4280 Accesses

Abstract

An advanced approach to Bayesian classification is based on exploited patterns. However, traditional pattern-based Bayesian classifiers cannot adapt to the evolving data stream environment. For that, an effective Pattern-based Bayesian classifier for Data Stream (PBDS) is proposed. First, a data-driven lazy learning strategy is employed to discover local frequent patterns for each test record. Furthermore, we propose a summary data structure for compact representation of data, and to find patterns more efficiently for each class. Greedy search and minimum description length combined with Bayesian network are applied to evaluating extracted patterns. Experimental studies on real-world and synthetic data streams show that PBDS outperforms most state-of-the-art data stream classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    \(P(y_i|\mathbf x ) = \frac{P(\mathbf x ,y_i)}{P(\mathbf x )}=\frac{P(y_i) \cdot P(\mathbf x |y_i)}{P(\mathbf x )}\).

  2. 2.

    Datasets Chess, Connect-4, EEG, MAGIC, PokerHand and CoverType are downloaded from UCI Machine Learning Repository http://archive.ics.uci.edu/ml/; Others are generated by the classical data generators separately with 1,000,000 records via MOA.

References

  1. Baralis, E., Cagliero, L., Garza, P.: Enbay: a novel pattern-based bayesian classifier. IEEE Trans. Knowl. Data Eng. 25(12), 2780–2795 (2013)

    Article  Google Scholar 

  2. Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. J. Mach. Learn. Res. 11(May), 1601–1604 (2010)

    Google Scholar 

  3. Bifet, A., Pfahringer, B., Read, J., Holmes, G.: Efficient data stream classification via probabilistic adaptive windows. In: Proceedings of the 28th Annual ACM Symposium on Applied Computing, pp. 801–806. ACM (2013)

    Google Scholar 

  4. Cheng, H., Yan, X., Han, J., Hsu, C.W.: Discriminative frequent pattern analysis for effective classification. In: 2007 IEEE 23rd International Conference on Data Engineering, pp. 716–725. IEEE (2007)

    Google Scholar 

  5. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Machine Learning, pp. 1022–1027 (1993)

    Google Scholar 

  6. Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)

    Article  MATH  Google Scholar 

  7. Gama, J., Kosina, P., et al.: Learning decision rules from data streams. In: IJCAI Proceedings-International Joint Conference on Artificial Intelligence, vol. 22, p. 1255 (2011)

    Google Scholar 

  8. Gomes, H.M., Barddal, J.P., Enembreck, F., Bifet, A.: A survey on ensemble learning for data stream classification. ACM Comput. Surv. (CSUR) 50(2), 23 (2017)

    Article  Google Scholar 

  9. Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 97–106. ACM (2001)

    Google Scholar 

  10. Li, H.F., Shan, M.K., Lee, S.Y.: DSM-FI: an efficient algorithm for mining frequent itemsets in data streams. Knowl. Inf. Syst. 17(1), 79–97 (2008)

    Article  Google Scholar 

  11. Meretakis, D., Wüthrich, B.: Extending Naive Bayes classifiers using long itemsets. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 165–174. ACM (1999)

    Google Scholar 

  12. Sun, Y., Wang, Z., Liu, H., Du, C., Yuan, J.: Online ensemble using adaptive windowing for data streams with concept drift. Int. J. Distrib. Sens. Netw. 12, 4218973 (2016)

    Article  Google Scholar 

  13. Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235. ACM (2003)

    Google Scholar 

  14. Yuan, J., Wang, Z., Han, M., Sun, Y.: A lazy associative classifier for time series. Intell. Data Anal. 19(5), 983–1002 (2015)

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by National Natural Science Foundation of China (Nos. 61672086 and 61702030) and the Fundamental Research Funds for the Central Universities (Nos. 2016RC048 and 2016YJS036).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jidong Yuan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Yuan, J., Wang, Z., Sun, Y., Zhang, W., Jiang, J. (2017). A Pattern-Based Bayesian Classifier for Data Stream. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10637. Springer, Cham. https://doi.org/10.1007/978-3-319-70093-9_92

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70093-9_92

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70092-2

  • Online ISBN: 978-3-319-70093-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics