Abstract
In this paper, the authors use multilayer perceptron (MLP) on hospital surveillance data to categorize admitted patients according to their critical conditions which can be classified as - low, medium and high, to distinguish the criticality. The paper addresses the over-fitting problem in the unbalanced dataset using two distinct approaches since the frequency of instances of the class ‘low’ is significantly higher than other classes. Besides trimming, the unbalanced dataset is balanced by introducing the Synthetic Minority Over-sampling Technique (SMOTE) algorithm coupled with Locally Linear Embedding (LLE). We have constructed three models and applied neural classifications and compared the performances with the decision tree based models that already exist in literature. We show that one of our models outperforms prior models in classification, contingent upon performance time trade-off, giving us an efficient model that handles large scale unbalanced dataset efficiently with standard classification performance. The models developed in this research can become imperative tools to doctors during epidemics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
International Centre for Diarrhoeal Disease Research, Bangladesh: http://www.icddrb.org/org/orgunits.jsp?idDetails=103&searchID=103. (retrieved May 20, 2014)
Rahman, R.M., Hasan, F.R.M.: Using and comparing different decision tree classification techniques for mining ICDDR, B Hospital Surveillance data. Expert Syst. Appl. 38(9), 11421–11436 (2011)
Wang, J., Xu, M., Wang, H., Zhang, J.: Classification of imbalanced data by using the SMOTE algorithm and locally linear embedding. In: 8th International Conference on Signal Processing, vol. 3 (2006). doi:10.1109/ICOSP.2006.345752
Haykin, S.: Neural Networks: A Comprehensive Foundation (2nd ed.). Prentice Hall (1998)
Pang, S., Kim, D., Bang, S.Y.: Face membership authentication using SVM classification tree generated by membership-based LLE data partition. IEEE Transactions on Neural Networks 16(2), 436–446 (2005)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Int. Res. 16(1), 321–357 (2002)
Weka: The Data Mining Software in Java: http://www.cs.waikato.ac.nz/ml/weka/. (retrieved on May 20, 2014)
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining, 1st edn. Addison-Wesley (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Firoze, A., Rahman, R.M. (2015). Mining ICDDR, B Hospital Surveillance Data Using Locally Linear Embedding Based SMOTE Algorithm and Multilayer Perceptron. In: Nguyen, N., Trawiński, B., Kosala, R. (eds) Intelligent Information and Database Systems. ACIIDS 2015. Lecture Notes in Computer Science(), vol 9011. Springer, Cham. https://doi.org/10.1007/978-3-319-15702-3_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-15702-3_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15701-6
Online ISBN: 978-3-319-15702-3
eBook Packages: Computer ScienceComputer Science (R0)