Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift | IGI Global Scientific Publishing
Reference Hub1
Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift

Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift

Nabil M. Hewahi (Computer Science Department, University of Bahrain, Zallaq, Bahrain) and Ibrahim M. Elbouhissi (Computer Science Department, Islamic University of Gaza Palestine, Gaza City, Israel)
Copyright: © 2015 |Volume: 7 |Issue: 2 |Pages: 29
ISSN: 1941-6296|EISSN: 1941-630X|EISBN13: 9781466677227|DOI: 10.4018/IJDSST.2015040103
Cite Article Cite Article

MLA

Hewahi, Nabil M., and Ibrahim M. Elbouhissi. "Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift." IJDSST vol.7, no.2 2015: pp.29-57. https://doi.org/10.4018/IJDSST.2015040103

APA

Hewahi, N. M. & Elbouhissi, I. M. (2015). Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift. International Journal of Decision Support System Technology (IJDSST), 7(2), 29-57. https://doi.org/10.4018/IJDSST.2015040103

Chicago

Hewahi, Nabil M., and Ibrahim M. Elbouhissi. "Concepts Seeds Gathering and Dataset Updating Algorithm for Handling Concept Drift," International Journal of Decision Support System Technology (IJDSST) 7, no.2: 29-57. https://doi.org/10.4018/IJDSST.2015040103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

In data mining, the phenomenon of change in data distribution over time is known as concept drift. In this research, the authors introduce a new approach called Concepts Seeds Gathering and Dataset Updating algorithm (CSG-DU) that gives the traditional classification models the ability to adapt and cope with concept drift as time passes. CSG-DU is concerned with discovering new concepts in data stream and aims to increase the classification accuracy using any classification model when changes occur in the underlying concepts. The proposed approach has been tested using synthetic and real datasets. The experiments conducted show that after applying the authors' approach, the classification accuracy increased from low values to high and acceptable ones. Finally, a comparison study between CSG-DU and Set Formation for Delayed Labeling algorithm (SFDL) has been conducted; SFDL is an approach that handles sudden and gradual concept drift. CSG-DU results outperforms SFDL in terms of classification accuracy.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global Scientific Publishing bookstore.