We live in a dynamic world, where changes are a part of everyday life. When there is a shift in data, the classification or prediction models need to be adaptive to the changes. In data mining the phenomenon of change in data distribution over time is known as concept drift. In this research, the authors propose an adaptive supervised learning with delayed labeling methodology. As a part of this methodology, the atuhors introduce Adaptive Training Set Formation for Delayed Labeling Algorithm (SFDL), which is based on selective training set formation. Our proposed solution is considered as the first systematic training set formation approach which takes into account delayed labeling problem. It can be used with any base classifier without the need to change the implementation or setting of this classifier. The authors test their algorithm implementation using synthetic and real dataset from various domains which might have different drift types (sudden, gradual, incremental recurrences) with different speed of change. The experimental results confirm improvement in classification accuracy as compared to ordinary classifier for all drift types. The authors\u2019 approach is able to increase the classifications accuracy with 20% in average and 56% in the best cases of our experimentations and it has not been worse than the ordinary classifiers in any case. Finally a comparison with other four related methods to deal with changing in user interest over time and handle recurrence drift is performed. These methods are simple incremental method, time window approach with different window size, instance weighting method and conceptual clustering and prediction framework (CCP). Results indicate the effectiveness of the proposed method over other methods in terms of classification accuracy.<\/p>","DOI":"10.4018\/jtd.2013010103","type":"journal-article","created":{"date-parts":[[2013,10,7]],"date-time":"2013-10-07T19:54:02Z","timestamp":1381175642000},"page":"33-55","source":"Crossref","is-referenced-by-count":2,"title":["Learning Concept Drift Using Adaptive Training Set Formation Strategy"],"prefix":"10.4018","volume":"4","author":[{"given":"Nabil M.","family":"Hewahi","sequence":"first","affiliation":[{"name":"Computer Science Department, Faculty of Information Technology, Islamic University of Gaza, Gaza, Palestine"}]},{"given":"Sarah N.","family":"Kohail","sequence":"additional","affiliation":[{"name":"Computer Science Department, Faculty of Information Technology, Islamic University of Gaza, Gaza, Palestine"}]}],"member":"2432","reference":[{"key":"jtd.2013010103-0","unstructured":"Brzezinski, D. (2010). Mining data streams with concept drift. Master thesis, Poznan University of Technology."},{"key":"jtd.2013010103-1","first-page":"18","article-title":"A case-based technique for tracking concept drift in spam filtering.","author":"S. J.Delany","year":"2005","journal-title":"Knowledge-Based Systems"},{"key":"jtd.2013010103-2","unstructured":"Harries, M. (1999). Splice-2 comparative evaluation: Electricity pricing. Technical report, The University of South Wales. Retrieved October 8, 2011, from http:\/\/www.liaad.up.pt\/~jgama\/ales\/ales_5.html"},{"key":"jtd.2013010103-3","doi-asserted-by":"publisher","DOI":"10.1007\/s10844-008-0053-8"},{"key":"jtd.2013010103-4","unstructured":"Katakis, I., Tsoumakas, G., & Vlahavas, I. (2008). An ensemble of classifiers for coping with recurring contexts in data streams. In Proceeding of 18th European Conference on Artificial Intelligence, Patras, Greece."},{"key":"jtd.2013010103-5","doi-asserted-by":"crossref","unstructured":"Kelly, M., Hand, D., & Adams, N. (1999, August 15-18). The impact of changing populations on classifier performance. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 367-371).","DOI":"10.1145\/312129.312285"},{"issue":"3","key":"jtd.2013010103-6","doi-asserted-by":"crossref","first-page":"281","DOI":"10.3233\/IDA-2004-8305","article-title":"Learning drifting concepts: Example selection vs. example weighting.","volume":"8","author":"R.Klinkenberg","year":"2004","journal-title":"Intelligent Data Analysis"},{"key":"jtd.2013010103-7","doi-asserted-by":"crossref","unstructured":"Ludmila, I., Kuncheva, J., & Salvador, S. (2008). Nearest neighbour classifiers for streaming data with delayed labeling. In Proceedings of the Eighth IEEE International Conference on Data Mining ICDM (pp.869-874).","DOI":"10.1109\/ICDM.2008.33"},{"key":"jtd.2013010103-8","unstructured":"Narasimhamurthy, A., & Kuncheva, L. (2007). A framework for generating data to simulate changing environments. In Proceeding of the 25th IASTED int. Multi-Conference, Artificial Intelligence and Applications (AIAP\u201f07) (pp. 384\u2013389). ACTA Press."},{"key":"jtd.2013010103-9","unstructured":"Nishida, K. (2008). Learning and detecting concept drift. PhD thesis, Hokkaido University, Japan, 2008."},{"key":"jtd.2013010103-10","doi-asserted-by":"crossref","unstructured":"Nishida, K., & Yamauchi, K. (2009). Learning, detecting, understanding, and predicting concept changes. International Joint Conference on Neural Networks (IJCNN) (pp. 2280-2287).","DOI":"10.1109\/IJCNN.2009.5178619"},{"key":"jtd.2013010103-11","doi-asserted-by":"crossref","unstructured":"Street, N., & Kim, Y. (2001). A streaming ensemble algorithm (SEA) for large-scale classification. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Retrieved October 8, 2011, from http:\/\/www.liaad.up.pt\/~kdus\/kdus_5.html","DOI":"10.1145\/502512.502568"},{"key":"jtd.2013010103-12","author":"A.Tsymbal","year":"2004","journal-title":"The problem of concept drift: Definitions and related work. Technical Report, Department of Computer Science"},{"key":"jtd.2013010103-13","unstructured":"UCI machine learning repository: Data sets. (n.d.). Retrieve October 8, 2011, from http:\/\/archive.ics.uci.edu\/ml\/datasets.html"},{"key":"jtd.2013010103-14","doi-asserted-by":"crossref","unstructured":"Wang, H., Fan, W., Yu, P., & Han, J. (2003). Mining concept-drifting data streams using ensemble classifiers. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '03) (pp.226-235). New York, NY: ACM.","DOI":"10.1145\/956750.956778"},{"key":"jtd.2013010103-15","doi-asserted-by":"publisher","DOI":"10.1007\/BF00116900"},{"key":"jtd.2013010103-16","doi-asserted-by":"publisher","DOI":"10.1142\/S0219622006002258"},{"key":"jtd.2013010103-17","unstructured":"\u017dliobait\u0117, I. (2010). Adaptive training set formation. PhD thesis, Vilnius University."}],"container-title":["International Journal of Technology Diffusion"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=88914","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,1]],"date-time":"2022-06-01T21:04:20Z","timestamp":1654117460000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/jtd.2013010103"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2013,1,1]]},"references-count":18,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,1]]}},"URL":"https:\/\/doi.org\/10.4018\/jtd.2013010103","relation":{},"ISSN":["1947-9301","1947-931X"],"issn-type":[{"value":"1947-9301","type":"print"},{"value":"1947-931X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,1,1]]}}}