Abstract
It is an important subject to find feature genes from microarray expression profiles in the study of microarray technology. In this paper, a hybrid algorithm using SVM and GA is proposed. We first find a feature gene subset and filter most genes which are unrelated with diseases according to certain significant level, gene importance and classification efficiency by Least Square Support Vector Machine. Then we apply an improved genetic algorithm to carry out feature selection, in which the information entropy is used as a fitness function. At last, we apply the proposed feature selection algorithm to the two expression data sets of microarray, evaluate the feature gene subsets that are obtained in different conditions. Simulated results show that both good classification efficiency and the important genes which are related with diseases could be obtained by using the hybrid algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Suykens, J.A.K., Vandewalle, J.: Least Squares Support Vector Machines Classifiers. Neural Processing Letters 9, 293–300 (1999)
Jiang, J.Q., Wu, C.G., Liang, Y.C.: Multi-category Classification by Least Squares Support Vector Regression. In: Wang, J., Liao, X.-F., Yi, Z. (eds.) ISNN 2005. LNCS, vol. 3496, pp. 863–868. Springer, Heidelberg (2005)
Li, X., Rao, S.Q., Wang, Y.D., et al.: Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucleic Acids Research 9, 2685–2694 (2004)
Alon, U., Barkai, N., Notterman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.J.: Broad Patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. USA. 96, 6745–6750 (1999)
Lv, S.L., Wang, Q.H., Li, X.: Two feature gene recognition methods based on decision forest. China Journal of Bioinformatics 3, 19–22 (2004)
Liu, Q., Yang, X.T.: Microarray Gene Expression Data Analysis Based on Support Vector Machine. Mini-Micro Systems 3, 363–366 (2005)
Golub, T.R., et al.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286, 531–537 (1999)
Toure, A., Basu, M.: Application of neural network to gene expression data for cancer classification [C]. In: International Joint Conference on Neural Networks (IJCNN), vol. 1, pp. 583–587 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xiong, W., Zhang, C., Zhou, C., Liang, Y. (2006). Selection for Feature Gene Subset in Microarray Expression Profiles Based on a Hybrid Algorithm Using SVM and GA. In: Min, G., Di Martino, B., Yang, L.T., Guo, M., Rünger, G. (eds) Frontiers of High Performance Computing and Networking – ISPA 2006 Workshops. ISPA 2006. Lecture Notes in Computer Science, vol 4331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11942634_66
Download citation
DOI: https://doi.org/10.1007/11942634_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49860-5
Online ISBN: 978-3-540-49862-9
eBook Packages: Computer ScienceComputer Science (R0)