Abstract
We develop a filter based feature selection approach in Multi-classification by optimizing the so called Generic Feature Selection (GeFS) measure and then using Multi Support Vector Machine (MSVM) classifiers. The problem is first formulated as a polynomial mixed 0-1 fractional programming and then equivalently transformed into a mixed 0-1 linear programming (M01LP) problem. DCA (Difference of Convex functions Algorithm), an innovative approach in nonconvex programming framework, is investigated to solve the M01LP problem. The proposed algorithm is applied on Intrusion Detection Systems (IDSs) and experiments are conducted through the benchmark KDD Cup 1999 dataset which contains millions of connection records audited and includes a wide variety of intrusions simulated in a military network environment. We compare our method with an embedded based method for MSVM using l 2 − l 0 regularizer. Preliminary numerical results show that the proposed algorithm is comparable with l 2 − l 0 regularizer MSVM on the ability of classification but requires less computation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
KDD Cup 1999 data set (1999), http://www.sigkdd.org/kddcup/index.php?section=1999&method=data
Chang, C.-T.: On the polynomial mixed 0-1 fractional programming problems. European Journal of Operational Research 131(1), 224–227 (2001)
Chang, C.-T.: An efficient linearization approach for mixed integer problems. European Journal of Operational Research 123, 652–659 (2000)
Nguyen, H.T., Franke, K., Petrovic, S.: Towards a generic feature-selection measure for intrusion detection. In: International Conference Pattern Recognition, pp. 1529–1532 (2010)
Nguyen, H.T., Franke, K., Petrovic, S.: Reliability in a feature-selection process for intrusion detection. In: Dai, H., Liu, J.N.K., Smirnov, E. (eds.) Reliable Knowledge Discovery, pp. 203–218. Springer US (2012)
Hall, M.: Correlation Based Feature Selection for Machine Learning. Doctoral Dissertation, University of Waikato, Department of Computer Science (1999)
Le Thi, H.A.: DC Programming and DCA, http://lita.sciences.univ-metz.fr/~lethi
Le Thi, H.A., Pham Dinh, T., Le, D.M.: Exact penalty in DC programming. Vietnam Journal of Mathematics 27(2), 169–178 (1999)
Le Thi, H.A., Nguyen, M.C.: Efficient Algorithms for Feature Selection in Multi-class Support Vector Machine. In: Nguyen, N.T., van Do, T., Thi, H.A. (eds.) ICCSAMA 2013. SCI, vol. 479, pp. 41–52. Springer, Heidelberg (2013)
Le, A.V., Le Thi, H.A., Nguyen, M.C., Zidna, A.: Network Intrusion Detection Based on Multi-Class Support Vector Machine. In: Nguyen, N.-T., Hoang, K., Jędrzejowicz, P. (eds.) ICCCI 2012, Part I. LNCS (LNAI), vol. 7653, pp. 536–543. Springer, Heidelberg (2012)
Le Thi, H.A., Pham Dinh, T., Huynh, V.N.: Exact penalty and Error Bounds in DC programming. Journal of Global Optimization 52(3), 509–535 (2012)
Chen, Y., Li, Y., Cheng, X.-Q., Guo, L.: Survey and Taxonomy of Feature Selection Algorithms in Intrusion Detection System. In: Lipmaa, H., Yung, M., Lin, D. (eds.) Inscrypt 2006. LNCS, vol. 4318, pp. 153–167. Springer, Heidelberg (2006)
Peng, H., Long, F., Ding, C.H.Q.: Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(8), 1226–1238 (2005)
Pham Dinh, T., Le Thi, H.A.: Convex analysis approach to dc programming: Theory, algorithms and applications. Acta Mathematica Vietnamica 22(1), 289–357 (1997)
Pham Dinh, T., Le Thi, H.A.: Dc optimization algorithms for solving the trust region subproblem. SIAM J. Optimization 8, 476–505 (1998)
Pham Dinh, T., Le Thi, H.A.: Recent advances on DC programming and DCA. To appear in Transactions on Computational Collective Intelligence, 37 pages. Springer (2013)
Weston, J., Watkins, C.: Support Vector Machines for Multi-Class Pattern Recognition. In: Proceedings - European Symposium on Artificial Neural Networks, ESANN 1999, pp. 219–224. D-Facto public (1999)
Le Thi, H.A., Pham Dinh, T.: DC programming in Communication Systems: challenging problems and methods. Vietnam Journal of Computer Science, invited issue, 21 pages (2013), doi:10.1007/s40595-013-0010-5
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Le Thi, H.A., Le, A.V., Vo, X.T., Zidna, A. (2014). A Filter Based Feature Selection Approach in MSVM Using DCA and Its Application in Network Intrusion Detection. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds) Intelligent Information and Database Systems. ACIIDS 2014. Lecture Notes in Computer Science(), vol 8398. Springer, Cham. https://doi.org/10.1007/978-3-319-05458-2_42
Download citation
DOI: https://doi.org/10.1007/978-3-319-05458-2_42
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05457-5
Online ISBN: 978-3-319-05458-2
eBook Packages: Computer ScienceComputer Science (R0)