Abstract
Two hybrid feature selection methods (SFSP and SBSP) which are composed by combining the sequential forward selection and the sequential backward selection together with the principal component analysis developed by utilizing quadratic discriminant analysis classification algorithmic criteria so as to utilize in the diagnosis of breast cancer fast and effectively are presented in this study. The tenfold cross-validation method has been applied in the algorithm, which is utilized as criteria during the selection of the features. The dimension of the feature space for input has been decreased from 9 to 4 thanks to the selection of these two hybrid features. The Artificial Neural Networks have been used as classifier. The cross-validation method has been preferred also in the phase of this classification as in the case of the selection of the feature in order to increase the reliability of the result. The Wisconsin Breast Cancer Database obtained from the UCI has been utilized so as to determine the correctness of the system suggested. The values of the average correctness of the classification obtained by utilizing a tenfold cross-validation of the two hybrid systems developed earlier are found, respectively, as follows: for SFSP + NN, 97.57 % and for SBSP + NN, 98.57 %. SBSP + NN system has been observed that, among the studies carried out by implementing the cross-validation method for the breast cancer, the result appears to be very promising. The acquired results have revealed that this hybrid system applied by means of reducing dimension is an utilizable system in order to diagnose the diseases faster and more successfully.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Akay MF (2009) Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst Appl 36(2):3240–3247. doi:10.1016/j.eswa.2008.01.009
Organization WH (2011) Cancer http://www.who.int/cancer/en/. Accessed 5 Sept 2011
Chou SM, Lee TS, Shao YE, Chen IF (2004) Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines. Expert Syst Appl 27(1):133–142. doi:10.1016/j.eswa.2003.12.013
Yeh WC, Chang WW, Chung YY (2009) A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method. Expert Syst Appl 36(4):8204–8211. doi:10.1016/j.eswa.2008.10.004
Goodman DE, Boggess L, Watkins A (2002) Artificial immune system classification of multiple-class problems. In: Proceedings of the Artificial Neural Networks in Engineering ANNIE 02, pp 179–183
Abonyi J, Szeifert F (2003) Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recognit Lett 24(14):2195–2207. doi:10.1016/S0167-8655(03)00047-3
Polat K, Gunes S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Process 17(4):694–701. doi:10.1016/j.dsp.2006.10.008
Karabatak M, Ince MC (2009) An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 36(2):3465–3469. doi:10.1016/j.eswa.2008.02.064
Chen HL, Yang B, Liu J, Liu DY (2011) A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl 38(7):9014–9022. doi:10.1016/j.eswa.2011.01.120
Marcano-Cedeno A, Quintanilla-Dominguez J, Andina D (2011) WBCD breast cancer database classification applying artificial metaplasticity neural network. Expert Syst Appl 38(8):9573–9579. doi:10.1016/j.eswa.2011.01.167
Xu Y, Qi Z, Wang J (2011) Breast cancer diagnosis based on a kernel orthogonal transform. Neural Comput Appl. doi:10.1007/s00521-011-0547-0
Senapati MR, Mohanty AK, Dash S, Dash PK (2011) Local linear wavelet neural network for breast cancer recognition. Neural Comput Appl. doi:10.1007/s00521-011-0670-y
Ubeyli ED (2007) Implementing automated diagnostic systems for breast cancer detection. Expert Syst Appl 33(4):1054–1062. doi:10.1016/j.eswa.2006.08.005
Wolberg WH (1991) Wisconsin Breast Cancer Database http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.names. Accessed 2 Sept 2011
Sasikala M, Kumaravel N (2005) Comparison of feature selection techniques for detection of malignant tumor in brain images. In: Indicon 2005 Proceedings:212–215
Zhu ZX, Ong YS, Dash M (2007) Wrapper-filter feature selection algorithm using a memetic framework. IEEE Trans Syst Man Cybern Part B Cybern 37(1):70–76. doi:10.1109/Tsmcb.2006.883267
Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324
Bermejo P, Gamez JA, Puerta JM (2011) A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets. Pattern Recognit Lett 32(5):701–711. doi:10.1016/j.patrec.2010.12.016
Pudil P, Novovicova J, Kittler J (1994) Floating search methods in feature-selection. Pattern Recognit Lett 15(11):1119–1125
Ladha L, Deepa T (2011) Feature selection methods and algorithms. Int J Comput Sci Eng (IJCSE) 3(5):1787–1797
Kim KS, Choi HH, Moon CS, Mun CW (2011) Comparison of k-nearest neighbor, quadratic discriminant and linear discriminant analysis in classification of electromyogram signals based on the wrist-motion directions. Curr Appl Phys 11(3):740–745. doi:10.1016/j.cap.2010.11.051
Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer, New York
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Calisir D, Dogantekin E (2011) A new intelligent hepatitis diagnosis system: PCA-LSSVM. Expert Syst Appl 38(8):10705–10708. doi:10.1016/j.eswa.2011.01.014
Sagiroglu S, Yilmaz N (2009) Web-based mobile robot platform for real-time exercises. Expert Syst Appl 36(2):3153–3166. doi:10.1016/j.eswa.2008.01.046
Ubeyli ED, Guler I (2004) Multilayer perceptron neural networks to compute quasistatic parameters of asymmetric coplanar waveguides. Neurocomputing 62:349–365. doi:10.1016/j.neucom.2004.04.005
Yıldırım H, Altınsoy H, Barışçı N, Ergün U, Oğur E, Hardalaç F, Güler İ (2004) Classification of the frequency of carotid stenosis with MLP and RBF neural networks in patients with coroner artery disease. J Med Syst 28(6):591–601
Haykin S (1994) Neural networks: a comprehensive foundation. Macmillan College Publishing Company, New York
Guler I, Ubeyli ED (2005) ECG beat classifier designed by combined neural network model. Pattern Recognit 38(2):199–208. doi:10.1016/j.patcog.2004.06.009
Oztemel E (2003) Yapay sinir ağları (Artificial Neural Networks). Papatya Publisher, Istanbul
Lee JD (1997) Object recognition using a neural network with optimal feature extraction. Math Comput Modell 25(12):105–117
Moller MF (1993) A Scaled Conjugate-Gradient Algorithm for Fast Supervised Learning. Neural Netw 6(4):525–533
Francois D, Rossi F, Wertz V, Verleysen M (2007) Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing 70:1276–1288
Diamantidis NA, Karlis D, Giakoumakis EA (2000) Unsupervised stratification of cross-validation for accuracy estimation. Artificial Intell 116:1–16
Acknowledgments
The authors are grateful to Selçuk University Scientific Research Projects Coordinatorship for support of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Uzer, M.S., Inan, O. & Yılmaz, N. A hybrid breast cancer detection system via neural network and feature selection based on SBS, SFS and PCA. Neural Comput & Applic 23, 719–728 (2013). https://doi.org/10.1007/s00521-012-0982-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-012-0982-6