End-to-End Aspect Extraction and Aspect-Based Sentiment Analysis Framework for Low-Resource Languages | SpringerLink
Skip to main content

End-to-End Aspect Extraction and Aspect-Based Sentiment Analysis Framework for Low-Resource Languages

  • Conference paper
  • First Online:
Intelligent Systems and Applications (IntelliSys 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 824))

Included in the following conference series:

  • 388 Accesses

Abstract

Due to the increasing volume of user-generated content on the web, the vast majority of businesses and organizations have focused their interest on sentiment analysis in order to gain insights and information about their customers. Sentiment analysis is a Natural Language Processing task that aims to extract information about the human emotional state. Specifically, sentiment analysis can be achieved on three different levels, namely at the document level, sentence level or the aspect/feature level. Since document and sentence levels can be too generic for an opinion estimation given specific attributes of a product or service, aspect-based sentiment analysis became the norm regarding the exploitation of user generated data. However, most human languages, with the exception of the English language, are considered low-resource languages due to the restricted resources available, leading to challenges in automating information extraction tasks. Accordingly, in this work, we propose a methodology for automatic aspect extraction and sentiment classification on Greek texts that can potentially be generalized to other low-resource languages. For the purpose of this study, a new dataset was created consisting of social media posts explicitly written in the Greek language from Twitter, Facebook and YouTube. We further propose Transformer-based Deep Learning architectures that are able to automatically extract the key aspects from texts and then classify them according to the author’s intent into three pre-defined classification categories. The results of the proposed methodology achieved relatively high F1-macro scores on all the classes denoting the importance of the proposed methodology on aspect extraction and sentiment classification on low-resource languages.

This research was carried out as part of the project KMP6-0096055 under the framework of the Action “Investment Plans of Innovation” of the Operational Program “Central Macedonia 2014–2020”, that is co-funded by the European Regional Development Fund and Greece.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 26311
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 32889
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://el.wikipedia.org/wiki/.

  2. 2.

    https://www.statmt.org/europarl/.

  3. 3.

    https://oscar-corpus.com/.

  4. 4.

    https://commoncrawl.org/.

  5. 5.

    https://huggingface.co/docs/transformers/model_doc/bert#transformers.BertTokenizer.

  6. 6.

    Available upon publication.

References

  1. Alexandridis, G., Varlamis, I., Korovesis, K., Caridakis, G., Tsantilas, P.: A survey on sentiment analysis and opinion mining in Greek social media. Information 12(8), 331 (2021)

    Article  Google Scholar 

  2. Athanasiou, V., Maragoudakis, M.: A novel, gradient boosting framework for sentiment analysis in languages where nlp resources are not plentiful: a case study for modern greek. Algorithms 10(1), 34 (2017)

    Article  MathSciNet  Google Scholar 

  3. Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., Grave, E., Ott, M., Zettlemoyer, L. and Stoyanov, V.: Unsupervised cross-lingual representation learning at scale (2019). arXiv:1911.02116

  4. Dai, J., Yan, H., Sun, T., Liu, P., Qiu, X.: Does syntax matter? a strong baseline for aspect-based sentiment analysis with roberta (2021). arXiv:2104.04986

  5. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding (2018). arXiv:1810.04805

  6. Honnibal, M., Montani, I., Van Landeghem, S., Boyd, A.: spacy: Industrial-strength natural language processing in python (2020). https://doi.org/10.5281/zenodo.1212303

  7. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the tenth ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)

    Google Scholar 

  8. Karimi, A., Rossi, L., Prati, A.: Adversarial training for aspect-based sentiment analysis with Bert. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 8797–8803. IEEE (2021)

    Google Scholar 

  9. Kaur, J., Kaur Sidhu, B.: Sentiment analysis based on deep learning approaches. In: 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), pp. 1496–1500 IEEE (2018)

    Google Scholar 

  10. Korovesis, K., Alexandridis, G., Caridakis, G., Polydoras, P., Tsantilas, P.: Leveraging aspect-based sentiment prediction with textual features and document metadata. In: 11th Hellenic Conference on Artificial Intelligence, pp. 168–174 (2020)

    Google Scholar 

  11. Koutsikakis, J., Chalkidis, I., Malakasiotis, P., Androutsopoulos, I.: Greek-Bert: The Greeks visiting sesame street. In: 11th Hellenic Conference on Artificial Intelligence, pp. 110–117 (2020)

    Google Scholar 

  12. Kvålseth, T.O: Note on cohen’s kappa. Psychol. Rep. 65(1), 223–226 (1989)

    Google Scholar 

  13. Li, X., Xingyu, F., Guangluan, X., Yang, Y., Wang, J., Jin, L., Liu, Q., Xiang, T.: Enhancing Bert representation with context-aware embedding for aspect-based sentiment analysis. IEEE Access 8, 46868–46876 (2020)

    Article  Google Scholar 

  14. Liapakis, A.: A sentiment lexicon-based analysis for food and beverage industry reviews. the Greek language paradigm. The Greek Language Paradigm (2020). Accessed from 20 May 2020

    Google Scholar 

  15. Magueresse, A., Carles, V., Heetderks, E.: Low-resource languages: A review of past work and future challenges (2020). arXiv:2006.07264

  16. Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting of the Association for Computational Linguistics, pp. 311–318 (2002)

    Google Scholar 

  17. Pavlopoulos, I.: Aspect based sentiment analysis. Athens University of Economics and Business (2014)

    Google Scholar 

  18. Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I. Semeval-2015 task 12: aspect based sentiment analysis. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 486–495 (2015)

    Google Scholar 

  19. Sang, E.F., Veenstra, J.: Representing text chunks (1999). cs/907006

    Google Scholar 

  20. Solakidis, G.S., Vavliakis, K.N., Mitkas, P.A.: Multilingual sentiment analysis using emoticons and keywords. In: 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), vol. 2, pp. 102–109. IEEE (2014)

    Google Scholar 

  21. Tiedemann, J.: Parallel data, tools and interfaces in opus. In: Lrec, vol. 2012, pp. 2214–2218. Citeseer (2012)

    Google Scholar 

  22. Tiedemann, J., Thottingal, S.: OPUS-MT - Building open translation services for the World. In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation (EAMT), Lisbon, Portugal (2020)

    Google Scholar 

  23. Wenzek, G., Lachaux, M.A., Conneau, A., Chaudhary, V., Guzman, F., Joulin, A., Grave, E.: Ccnet: Extracting high quality monolingual datasets from web crawl data (2019). arXiv:1911.00359

  24. Xu, H., Liu, B., Shu, L., Yu, P.S.: Bert post-training for review reading comprehension and aspect-based sentiment analysis (2019). arXiv:1904.02232

  25. Yang, Yu., Duan, W., Cao, Q.: The impact of social and conventional media on firm equity value: a sentiment analysis approach. Decis. Supp. Syst. 55(4), 919–926 (2013)

    Article  Google Scholar 

  26. Zhao, A., Yu, Yu.: Knowledge-enabled Bert for aspect-based sentiment analysis. Knowl.-Based Syst. 227, 107220 (2021)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dimitrios Zaikis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Aivatoglou, G., Fytili, A., Arampatzis, G., Zaikis, D., Stylianou, N., Vlahavas, I. (2024). End-to-End Aspect Extraction and Aspect-Based Sentiment Analysis Framework for Low-Resource Languages. In: Arai, K. (eds) Intelligent Systems and Applications. IntelliSys 2023. Lecture Notes in Networks and Systems, vol 824. Springer, Cham. https://doi.org/10.1007/978-3-031-47715-7_56

Download citation

Publish with us

Policies and ethics