Abstract
Emotion classification from text is the process of identifying and classifying emotions expressed in textual data. Emotions can be feelings such as anger, joy, suspense, sadness and neutral. Developing a machine learning model to identify emotions in a low-resourced language with a limited set of linguistic resources and annotated corpora is a challenge. This research proposes a Deep Learning Emotion Classification Framework to identify and classify emotions in low-resourced languages such as Hindi. The proposed framework combines a classification model and a low resource optimization technique in a novel way. An annotated corpus of Hindi short stories consisting of 20,304 sentences is used to train the models for predicting five categories of emotions: anger, joy, suspense, sadness, and neutral talk. To resolve the class imbalance in the dataset SMOTE technique is applied. The optimal classification model is selected through experimentation that compares machine learning models and pre-trained models. Machine learning and deep learning models are SVM, Logistic Regression, Random Forest, CNN, BiLSTM, and CNN+BiLSTM. The pre-trained models, mBERT, IndicBERT, and a hybrid model, mBERT+BiLSTM. The models are evaluated based on macro average recall, macro average precision, and macro average F1 score. Results demonstrate that the hybrid model mBERT+BiLSTM out perform other models with a test accuracy of 57%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Acheampong, F.A., Wenyu, C., Nunoo-Mensah, H.: Text-based emotion detection: advances, challenges, and opportunities. Eng. Rep. 2(7), e12189 (2020)
Das, A., Sharif, O., Hoque, M.M., Sarker, I.H.: Emotion classification in a resource constrained language using transformer-based approach (2021). arXiv preprint arXiv:2104.08613
Alam, T., Khan, A., Alam, F.: Bangla text classification using transformers (2020). arXiv preprint arXiv:2011.04446
Bharti, S.K., et al.: Text-based emotion recognition using deep learning approach. Comput. Intell. Neurosci. (2022)
Midhan, T.M., Selvaraj, P., Raju, M.H.K., Reddy, M.B.P., Bhaskar, T.: Classification of mental health and emotion of human from text using machine learning approaches. In: 2023 6th International Conference on Information Systems and Computer Networks (ISCON), pp. 1–7. IEEE, March 2023
Xu, D., Tian, Z., Lai, R., Kong, X., Tan, Z., Shi, W.: Deep learning based emotion analysis of microblog texts. Inf. Fusion 64, 1–11 (2020)
Kannan, E., Kothamasu, L.A.: Fine-tuning BERT based approach for multi-class sentiment analysis on twitter emotion data. Ingénierie des Systémes d’Information 27(1) (2022)
Sonu, S., Haque, R., Hasanuzzaman, M., Stynes, P., Pathak, P.: Identifying emotions in code mixed Hindi-English tweets. In: Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference, pp. 35–41. European Language Resources Association, Marseille, France (2022)
Li, A., Yi, S.: Emotion analysis model of microblog comment text based on CNN-BiLSTM. Comput. Intell. Neurosci. (2022)
Gou, Z., Li, Y.: Integrating BERT embeddings and BiLSTM for emotion analysis of dialogue. Comput. Intell. Neurosci. (2023)
Li, X., Lei, Y., Ji, S.: BERT-and BiLSTM-based sentiment analysis of online Chinese buzzwords. Future Internet 14(11), 332 (2022)
Ozturk, O., Ozcan, A.: Sentiment analysis in turkish using transformer-based deep learning models. In: Hemanth, D.J., Yigit, T., Kose, U., Guvenc, U. (eds.) The International Conference on Artificial Intelligence and Applied Mathematics in Engineering, vol. 7, pp. 1–15. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-31956-3_1
Ranathunga, S., Liyanage, I.U.: Sentiment analysis of Sinhala news comments. Trans. Asian Low-Resource Lang. Inf. Process. 20(4), 1–23 (2021)
Ucan, A., Dörterler, M., Akçapinar Sezer, E.: A study of Turkish emotion classification with pretrained language models. J. Inf. Sci. 48(6), 857–865 (2022)
Kumar, Y., Mahata, D., Aggarwal, S., Chugh, A., Maheshwari, R., Shah, R.R.: BHAAV-A text corpus for emotion analysis from Hindi stories (2019). arXiv preprint arXiv:1910.04073
Kannan, R.R., Rajalakshmi, R., Kumar, L.: IndicBERT based approach for sentiment analysis on code-mixed Tamil tweets (2021)
Fischer, M., Haque, R., Stynes, P., Pathak, P.: Identifying fake news in Brazilian Portuguese. In: Rosso, P., Basile, V., Martínez, R., Métais, E., Meziane, F. (eds.) International Conference on Applications of Natural Language to Information Systems, vol. 13286, pp. 111–118. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-08473-7_10
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Manisha, Clifford, W., McLaughlin, E., Stynes, P. (2023). A Deep Learning Emotion Classification Framework for Low Resource Languages. In: Goyal, V., Kumar, N., Bhowmick, S.S., Goyal, P., Goyal, N., Kumar, D. (eds) Big Data and Artificial Intelligence. BDA 2023. Lecture Notes in Computer Science, vol 14418. Springer, Cham. https://doi.org/10.1007/978-3-031-49601-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-49601-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49600-4
Online ISBN: 978-3-031-49601-1
eBook Packages: Computer ScienceComputer Science (R0)