Abstract
In the recent years, the number of web logs, and the amount of opinionated data on the World Wide Web, have been grown substantially. The ability to determine the political orientation of an article automatically can be beneficial in many areas from academia to security. However, the sentiment classification of web log posts (political web log posts in particular), is apparently more complex than the sentiment classification of conventional text. In this paper, a supervised machine learning with two feature extraction techniques Term Frequency (TF) and Term Frequency-Inverse Document Frequency (TF-IDF) are used for the classification process. For investigation, SVM with four kernels for supervised machine learning have been employed. Subsequent to testing, the results reveal that the linear with TF achieved the results in accuracy of 91.935% also with TF-IDF achieved the 95.161%. The linear kernel was deemed the most suitable for our model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)
Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, vol. 2, no. 2010, pp. 627–666 (2010)
Liu, B.: Sentiment analysis and opinion mining. In: Synthesis Lectures on Human Language Technologies, vol. 5, no. 1, pp. 1–167 (2012)
Tan, C., Lee, L., Tang, J., Jiang, L., Zhou, M., Li, P.: User-level sentiment analysis incorporating social networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1397–1405. ACM (2011)
Xia, R., Zong, C., Li, S.: Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 181(6), 1138–1152 (2011)
Prabowo, R., Thelwall, M.: Sentiment analysis: a combined approach. J. Inform. 3(2), 143–157 (2009)
Brahimi, B., Touahria, M., Tari, A.: Data and text mining techniques for classifying Arabic tweet polarity. J. Dig. Inf. Manag. 14(1), 15–25 (2016)
Catal, C., Nangir, M.: A sentiment classification model based on multiple classifiers. Appl. Soft Comput. 50, 135–141 (2017)
Balazs, J.A., Velásquez, J.D.: Opinion mining and information fusion: a survey. Inf. Fusion 27, 95–110 (2016)
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl.-Based Syst. 89, 14–46 (2015)
Zhou, S., Chen, Q., Wang, X.: Fuzzy deep belief networks for semi-supervised sentiment classification. Neurocomputing 131, 312–322 (2014)
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)
Sindhwani, V., Melville, P.: Document-word co-regularization for semi-supervised sentiment analysis. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 1025–1030. IEEE (2008)
Nasukawa, T., Yi, J.: Sentiment analysis: capturing favorability using natural language processing. In: Proceedings of the 2nd International Conference on Knowledge Capture, pp. 70–77. ACM (2003)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics (2004)
Hearst, M.A.: Direction-based text interpretation as an information access refinement. In: Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval, pp. 257–274 (1992)
Sack, W.: On the computation of point of view. In: AAAI, p. 1488 (1994)
Huettner, A., Subasic, P.: Fuzzy typing for document management. In: ACL 2000 Companion Volume: Tutorial Abstracts and Demonstration Notes, pp. 26–27 (2000)
Das, S., Chen, M.: Yahoo! for Amazon: extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), Bangkok, Thailand, vol. 35, p. 43 (2001)
Wang, Z., Tong, V.J.C., Ruan, P., Li, F.: Lexicon knowledge extraction with sentiment polarity computation. In: 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), pp. 978–983. IEEE (2016)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics (1997)
Turney, P.D., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. arXiv preprint cs/0212012 (2002)
Oussous, A., Lahcen, A.A., Belfkih, S.: Impact of text pre-processing and ensemble learning on Arabic sentiment analysis. In: Proceedings of the 2nd International Conference on Networking, Information Systems & Security, p. 65. ACM (2019)
Hardeniya, N., Perkins, J., Chopra, D., Joshi, N., Mathur, I.: Natural Language Processing: Python and NLTK. Packt Publishing Ltd., Birmingham (2016)
Mustafa, M., Eldeen, A.S., Bani-Ahmad, S., Elfaki, A.O.: A comparative survey on arabic stemming: approaches and challenges. Intell. Inf. Manag. 9(02), 39 (2017)
Abooraig, R., Al-Zu’bi, S., Kanan, T., Hawashin, B., Al Ayoub, M., Hmeidi, I.: Automatic categorization of Arabic articles based on their political orientation. Digit. Investig. 25, 24–41 (2018)
Taghva, K., Elkhoury, R., Coombs, J.: Arabic stemming without a root dictionary. In: International Conference on Information Technology: Coding and Computing (ITCC 2005)-Volume II, vol. 1, pp. 152–157. IEEE (2005)
Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 163–222. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_6
Alowaidi, S., Saleh, M., Abulnaja, O.: Semantic sentiment analysis of arabic texts. Int. J. Adv. Comput. Sci. Appl. 8(2), 256–262 (2017)
Abd, D.H., Sadiq, A.T., Abbas, A.R.: A new framework for automatic extraction polarity and target of articles (2019)
Deng, N., Tian, Y.: Support Vector Machines: A New Method in Data Mining. Science Press, Beijing (2004)
Al-Mejibli, I.S., Abd, D.H., Alwan, J.K., Rabash, A.J.: Performance evaluation of kernels in support vector machine. In: 2018 1st Annual International Conference on Information and Sciences (AiCIS), pp. 96–101. IEEE (2018)
Khalaf, M., et al.: An application of using support vector machine based on classification technique for predicting medical data sets. In: Huang, D.-S., Jo, K.-H., Huang, Z.-K. (eds.) ICIC 2019. LNCS, vol. 11644, pp. 580–591. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26969-2_55
Khalaf, M., et al.: Recurrent neural network architectures for analysing biomedical data sets. In: 2017 10th International Conference on Developments in eSystems Engineering (DeSE), pp. 232–237. IEEE (2017)
Abd, D.H., Alwan, J.K., Ibrahim, M., Naeem, M.B.: The utilisation of machine learning approaches for medical data classification and personal care system management for sickle cell disease. In: Annual Conference on New Trends in Information & Communications Technology 2017 (2017)
Mayoraz, E., Alpaydin, E.: Support vector machines for multi-class classification. In: Mira, J., Sánchez-Andrés, Juan V. (eds.) IWANN 1999. LNCS, vol. 1607, pp. 833–842. Springer, Heidelberg (1999). https://doi.org/10.1007/BFb0100551
Acknowledgments
The authors would like to thank Al-Maarif University College and Dr. Falah Mubark Bardan for supporting this research.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Abd, D.H., Sadiq, A.T., Abbas, A.R. (2020). Classifying Political Arabic Articles Using Support Vector Machine with Different Feature Extraction. In: Khalaf, M., Al-Jumeily, D., Lisitsa, A. (eds) Applied Computing to Support Industry: Innovation and Technology. ACRIT 2019. Communications in Computer and Information Science, vol 1174. Springer, Cham. https://doi.org/10.1007/978-3-030-38752-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-38752-5_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38751-8
Online ISBN: 978-3-030-38752-5
eBook Packages: Computer ScienceComputer Science (R0)