{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T01:33:58Z","timestamp":1722648838956},"reference-count":16,"publisher":"IGI Global","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,10]]},"abstract":"In practice, the dataset collected from data mining usually contains some missing values. It is common practice to perform case deletion by ignoring those data with missing values if the missing rate is certainly small. The aim of this paper is to answer the following question: When should one directly ignore sampled data with missing values? By using different types of datasets having various numbers of attributes, data samples, and classes, it is found that there are some specific patterns that can be considered for case deletion over different datasets without significant performance degradation. In particular, these patterns are extracted to act as the decision rules by a decision tree model. In addition, a comparison is made between cases with deletion and imputation over different datasets with the allowed missing rates and the decision rules. The results show that the classification performance results obtained by case deletion and imputation are similar, which demonstrates the reliability of the extracted decision rules.<\/jats:p>","DOI":"10.4018\/ijdwm.2017100104","type":"journal-article","created":{"date-parts":[[2017,8,15]],"date-time":"2017-08-15T17:16:10Z","timestamp":1502817370000},"page":"53-63","source":"Crossref","is-referenced-by-count":2,"title":["When Should We Ignore Examples with Missing Values?"],"prefix":"10.4018","volume":"13","author":[{"given":"Wei-Chao","family":"Lin","sequence":"first","affiliation":[{"name":"Asia University, Taichung, Taiwan"}]},{"given":"Shih-Wen","family":"Ke","sequence":"additional","affiliation":[{"name":"Chung Yuan Christian University, Taoyuan, Taiwan"}]},{"given":"Chih-Fong","family":"Tsai","sequence":"additional","affiliation":[{"name":"Department of Information Management, National Central University, Jhongli, Taiwan"}]}],"member":"2432","reference":[{"key":"IJDWM.2017100104-0","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-17103-1_60"},{"key":"IJDWM.2017100104-1","doi-asserted-by":"publisher","DOI":"10.1080\/713827181"},{"key":"IJDWM.2017100104-2","article-title":"Trends in data mining and knowledge discovery","author":"K. J.Cois","year":"2002","journal-title":"Knowledge discovery in advanced information systems"},{"key":"IJDWM.2017100104-3","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010395805406"},{"key":"IJDWM.2017100104-4","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.1979.4310090"},{"key":"IJDWM.2017100104-5","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2008.05.019"},{"key":"IJDWM.2017100104-6","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-009-0295-6"},{"key":"IJDWM.2017100104-7","doi-asserted-by":"publisher","DOI":"10.1109\/34.824819"},{"key":"IJDWM.2017100104-8","unstructured":"Jonsson, P., & Wohlin, C. (2004) An evaluation of k-nearest neighbor imputation using likert data. In Proceedings of theIEEE International Symposium on Software Metrics (pp. 108-118)."},{"key":"IJDWM.2017100104-9","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008334909089"},{"key":"IJDWM.2017100104-10","first-page":"1227","article-title":"Regression with missing X\u2019s: A review.","volume":"87","author":"R. J. A.Little","year":"1992","journal-title":"Journal of the American Statistical Association"},{"key":"IJDWM.2017100104-11","author":"R. J. A.Little","year":"1987","journal-title":"Statistical analysis with missing data"},{"key":"IJDWM.2017100104-12","doi-asserted-by":"publisher","DOI":"10.1177\/0013164487471002"},{"key":"IJDWM.2017100104-13","doi-asserted-by":"publisher","DOI":"10.1109\/32.962560"},{"issue":"1","key":"IJDWM.2017100104-14","first-page":"32","article-title":"Parimputation: From imputation and null-imputation to partially imputation.","volume":"9","author":"S.Zhang","year":"2008","journal-title":"IEEE Intelligent Informatics Bulletin"},{"key":"IJDWM.2017100104-15","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2010.99"}],"container-title":["International Journal of Data Warehousing and Mining"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=188490","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T23:49:56Z","timestamp":1651794596000},"score":1,"resource":{"primary":{"URL":"http:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJDWM.2017100104"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2017,10]]},"references-count":16,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.4018\/ijdwm.2017100104","relation":{},"ISSN":["1548-3924","1548-3932"],"issn-type":[{"value":"1548-3924","type":"print"},{"value":"1548-3932","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,10]]}}}