Abstract
The paper presents the data dimensionality reduction in the classification process, with a special presentation of using the ability of features weighting by determining the level of importance of a given attribute in the data vector. This reduction was implemented using the Forest Optimization Algorithm (FOA) and the use of a classifier allowing to enter the importance of each attribute for a data vector. The paper presents both, a description of the capability of using the FOA algorithm as well as the possibility of introducing modifications which allows to regulate the objective function between the obtained classification result and the number of reduced features. The conducted tests and obtained results were also presented. At the end of paper, a summary and the final conclusions are provided.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. Knowl. Data Eng. 5(6), 914–925 (1993)
Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991)
Aha, D.W.: Feature weighting for lazy learning algorithms. In: Feature Extraction, Construction and Selection, pp. 13–32. Springer (1998)
Arie, B.D.: Comparison of classification accuracy using cohen’s weighted kappa. Expert. Syst. Appl. 34(2), 825–832 (2008)
Bach, M., Werner, A.: Cost-sensitive feature selection for class imbalance problem. In: International Conference on Information Systems Architecture and Technology, pp. 182–194. Springer (2017)
Costa, E., Lorena, A., Carvalho, A., Freitas, A.: A review of performance evaluation measures for hierarchical classifiers. In: Evaluation Methods for Machine Learning II: Papers from the AAAI-2007 Workshop, pp. 1–6 (2007)
Doak, J.: An evaluation of feature selection methods and their application to computer security. Technical Report, Department of Computer Science, University of California, Davis CA (1992)
Gasca, E., Sánchez, J.S., Alonso, R.: Eliminating redundancy and irrelevance using a new mlp-based feature selection method. Pattern Recognit. 39(2), 313–315 (2006)
Ghaemi, M., Feizi-Derakhshi, M.R.: Forest optimization algorithm. Expert. Syst. Appl. 41(15), 6676–6687 (2014)
Ghaemi, M., Feizi-Derakhshi, M.R.: Feature selection using forest optimization algorithm. Pattern Recognit. 60, 121–129 (2016)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3(Mar), 1157–1182 (2003)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
Kostrzewa, D., Brzeski, R.: The data dimensionality reduction in the classification process through greedy backward feature elimination. In: International Conference on Man–Machine Interactions, pp. 397–407. Springer (2017)
Kostrzewa, D., Brzeski, R., Kubanski, M.: The classification of music by the genre using the knn classifier. In: International Conference: Beyond Databases, Architectures and Structures, pp. 233–242. Springer (2018)
Liu, H., Motoda, H.: Feature Extraction, Construction and Selection: A Data Mining Perspective, vol. 453. Springer (1998)
Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining, vol. 454. Springer (2012)
Özşen, S., Güneş, S.: Attribute weighting via genetic algorithms for attribute weighted artificial immune system (awais) and its application to heart disease and liver disorders problems. Expert. Syst. Appl. 36(1), 386–392 (2009)
Papakostas, G., Koulouriotis, D., Polydoros, A., Tourassis, V.: Evolutionary feature subset selection for pattern recognition applications. In: Evolutionary Algorithms. InTech (2011)
Powers, D.M.: Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation (2011)
Rosner, A., Weninger, F., Schuller, B., Michalak, M., Kostek, B.: Influence of low-level features extracted from rhythmic and harmonic sections on music genre classification. In: Man-Machine Interactions, vol. 3, pp. 467–473. Springer (2014)
Tosun, A., Turhan, B., Bener, A.B.: Feature weighting heuristics for analogy-based effort estimation models. Expert. Syst. Appl. 36(7), 10325–10333 (2009)
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)
UCI Machine Learning Repository: Dermatology data set. Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml/datasets/dermatology
Wu, X., Kumar, V., Quinlan, J.R., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng, A., Liu, B., Philip, S.Y., et al.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)
Yang, J., Honavar, V.: Feature subset selection using a genetic algorithm. In: Feature Extraction, Construction and Selection, pp. 117–136. Springer (1998)
Yang, Z.M., He, J.Y., Shao, Y.H.: Feature selection based on linear twin support vector machines. Procedia Comput. Sci. 17, 1039–1046 (2013)
Acknowledgements
This work was supported by BKM-509/RAU2/2017 (DK) and BK-213/RAu2/2018 (RB) grants from the Institute of Informatics, Silesian University of Technology, Gliwice, Poland.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kostrzewa, D., Brzeski, R. (2020). The Data Dimensionality Reduction and Features Weighting in the Classification Process Using Forest Optimization Algorithm. In: Huk, M., Maleszka, M., Szczerbicki, E. (eds) Intelligent Information and Database Systems: Recent Developments. ACIIDS 2019. Studies in Computational Intelligence, vol 830. Springer, Cham. https://doi.org/10.1007/978-3-030-14132-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-14132-5_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14131-8
Online ISBN: 978-3-030-14132-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)