Abstract
This paper is focused on multimodal approaches to malware detection, which have not been explored widely in related works. We use static code-based features and dynamic power-based, network traffic-based, and system log-based features, and propose multimodal approaches that use feature level and decision level fusion. Our findings include: (1) For all considered learners, power-based features alone were very good predictors; some learners performed well using only network traffic-based features. (2) For most standard supervised learning algorithms, feature level fusion improved all performance metrics. If Recall is the highest priority, Random Forest or J48 with feature level fusion should be selected. (3) The proposed deep neural network with decision level fusion had lower Recall, but higher Precision and (1-FPR) values, which led to comparable F-score and better G-score than the Random Forest with feature level fusion. In addition to improving classification performance, multimodal approaches make malware evasion of detection much harder.
This work was done while Jarilyn Hernández Jiménez was affiliated with West Virginia University and is included in her Ph.D. dissertation [1].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hernández Jiménez, J.: Multimodal approach for malware detection. Ph.D. thesis, West Virginia University, Morgantown, WV (April 2019)
Shijo, P., Salim, A.: Integrated static and dynamic analysis for malware detection. Procedia Comput. Sci. 46, 804–811 (2015)
Ahmadi, M., et al.: Malware detection by behavioural sequential patterns. Comput. Fraud Secur. J. 2013, 11–19 (2013)
Baltrušaitis, T., et al.: Multimodal machine learning: a survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41, 423–443 (2019)
Atrey, P.K., Hossain, M.A., El Saddik, A., Kankanhalli, M.S.: Multimodal fusion for multimedia analysis: a survey. Multimed. Syst. 16, 345–379 (2010)
Yan, P., Yan, Z.: A survey on dynamic mobile malware detection. Softw. Qual. J. 26, 891–919 (2018)
Ye, Y., et al.: A survey on malware detection using data mining techniques. ACM CSUR 50, 41:1–41:40 (2017)
Hernández Jiménez, J., et al.: Malware detection on general-purpose computers using power consumption monitoring: a proof of concept and case study. arXiv preprint arXiv:1705.01977 (2017)
Hernández Jiménez, J., Goseva-Popstojanova, K.: The effect on network flows-based features and training set size on malware detection. In: 17th IEEE International Symposium on NCA, pp. 1–9 (2018)
Hernández Jiménez, J., Goseva-Popstojanova, K.: Malware detection using power consumption and network traffic data. In: 2nd International Conference on Data Intelligence and Security (ICDIS), pp. 53–59. IEEE (2019)
Dawson, J.A., et al.: Rootkit detection through phase-space analysis of power voltage measurements. In: 12th International Conference on MALWARE, pp. 19–27 (2017)
Luckett, P., et al.: Identifying stealth malware using CPU power consumption and learning algorithms. J. Comput. Secur. 26, 589–613 (2018)
Bridges, R., Hernández Jiménez, J., et al.: Towards malware detection via CPU power consumption: data collection design and analytics. In: 17th IEEE International Conference on TrustCom, pp. 1680–1684 (2018)
Wei, S., Aysu, A., Orshansky, M., Gerstlauer, A., Tiwari, M.: Using power-anomalies to counter evasive micro-architectural attacks in embedded systems. In: 2019 IEEE International Symposium on Hardware Oriented Security and Trust (HOST), pp. 111–120. IEEE (2019)
Dollah, R.F.M., et al.: Machine learning for HTTP botnet detection using classifier algorithms. J. Telecommun. Electron. Comput. Eng. 10, 27–30 (2018)
Bekerman, D., et al.: Unknown malware detection using network traffic classification. In: IEEE Conference on CNS, pp. 134–142 (2015)
Prasse, P., et al.: Malware detection by analysing encrypted network traffic with neural networks. In: ECML PKDD, pp. 73–88 (2017)
Palmieri, F., Fiore, U., Castiglione, A.: A distributed approach to network anomaly detection based on independent component analysis. Concurr. Comput. Pract. Exp. 26, 1113–1129 (2014)
Sainju, A.M., Atkison, T.: An experimental analysis of Windows log events triggered by malware. In: ACM SouthEast Conference, pp. 195–198 (2017)
Ozsoy, M., et al.: Malware-aware processors: a framework for efficient online malware detection. In: International Symposium on HPCA, pp. 651–661 (2015)
De Lorenzo, A., Martinelli, F., Medvet, E., Mercaldo, F., Santone, A.: Visualizing the outcome of dynamic analysis of Android malware with VizMal. J. Inf. Secur. Appl. 50, 102423 (2020)
Fraley, J.B., Figueroa, M.: Polymorphic malware detection using topological feature extraction with data mining. In: IEEE SoutheastCon, pp. 1–7 (2016)
Khan, R.U., et al.: Analysis of ResNet and GoogleNet models for malware detection. J. Comput. Virol. Hacking Tech. 15, 29–37 (2018)
Kapoor, A., Dhavale, S.: Control flow graph based multiclass malware detection using bi-normal separation. Def. Sci. J. 66, 138–145 (2016)
Santos, I., et al.: OPEM: a static-dynamic approach for machine-learning-based malware detection. In: International Joint Conference CISIS Special Sessions, pp. 271–280. Springer (2013)
Anderson, B., et al.: Improving malware classification: bridging the static/dynamic gap. In: Workshop on AISec, pp. 3–14 (2012)
Yan, G., et al.: Exploring discriminatory features for automated malware classification. In: International Conference on DIMVA, pp. 41–61. Springer (2013)
Stiborek, J., et al.: Multiple instance learning for malware classification. Expert Syst. Appl. 93, 346–357 (2018)
Sheen, S., Anitha, R., Natarajan, V.: Android based malware detection using a multifeature collaborative decision fusion approach. Neurocomputing 151, 905–912 (2015)
Chen, L., Hou, S., Ye, Y.: SecureDroid: enhancing security of machine learning-based detection against adversarial Android malware attacks. In: 33rd ACSA Conference, pp. 362–372. ACM (2017)
Yerima, S.Y., Sezer, S.: DroidFusion: a novel multilevel classifier fusion approach for Android malware detection. IEEE Trans. Cybern. 49, 453–466 (2018)
Wang, X., Zhang, D., Su, X., Li, W.: Mlifdect: Android malware detection based on parallel machine learning and information fusion. Secur. Commun. Netw. 2017, 1–14 (2017)
Zhang, Y., et al.: Based on multi-features and clustering ensemble method for automatic malware categorization. In: IEEE Trustcom/BigDataSE/ICESS, pp. 73–82 (2017)
Kim, T., et al.: A multimodal deep learning method for Android malware detection using various features. IEEE Trans. Inf. Forensics Secur. 14, 773–788 (2019)
Wang, C., et al.: A malware detection method based on sandbox, binary instrumentation and multidimensional feature extraction. In: International Conference on BWCCA, pp. 427–438. Springer (2017)
Clevert, D.A., et al.: Fast and accurate deep network learning by exponential linear units. arXiv preprint arXiv:1511.07289 (2015)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Srivastava, N., et al.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
Hsiao, S.C., Kao, D.Y., Liu, Z.Y., Tso, R.: Malware image classification using one-shot learning with Siamese networks. Procedia Comput. Sci. 159, 1863–1871 (2019)
Feng, S., Zhou, H., Donga, H.: Using deep neural network with small dataset to predict material defects. Mater. Des. 162, 300–310 (2019)
Ghosh, A., et al.: On the robustness of decision tree learning under label noise. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 685–697. Springer (2017)
Nettleton, D.F., Orriols-Puig, A., Fornells, A.: A study of the effect of different types of noise on the precision of supervised learning techniques. Artif. Intell. Rev. 33, 275–306 (2010)
Acknowledgments
This work is funded by the National Science Foundation under the grant CNS-1618629.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hernández Jiménez, J.M., Goseva-Popstojanova, K. (2020). Using Four Modalities for Malware Detection Based on Feature Level and Decision Level Fusion. In: Barolli, L., Amato, F., Moscato, F., Enokido, T., Takizawa, M. (eds) Advanced Information Networking and Applications. AINA 2020. Advances in Intelligent Systems and Computing, vol 1151. Springer, Cham. https://doi.org/10.1007/978-3-030-44041-1_117
Download citation
DOI: https://doi.org/10.1007/978-3-030-44041-1_117
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-44040-4
Online ISBN: 978-3-030-44041-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)