Assigning Polarity Scores to Reviews Using Machine Learning Techniques

Okanohara, Daisuke; Tsujii, Jun’ichi

doi:10.1007/11562214_28

Daisuke Okanohara²² &
Jun’ichi Tsujii^22,23,24

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3651))

Included in the following conference series:

International Conference on Natural Language Processing

1684 Accesses

Abstract

We propose a novel type of document classification task that quantifies how much a given document (review) appreciates the target object using not binary polarity (good or bad) but a continuous measure called sentiment polarity score (sp-score). An sp-score gives a very concise summary of a review and provides more information than binary classification. The difficulty of this task lies in the quantification of polarity. In this paper we use support vector regression (SVR) to tackle the problem. Experiments on book reviews with five-point scales show that SVR outperforms a multi-class classification method using support vector machines and the results are close to human performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Sentiment Polarity Detection on Bengali Book Reviews Using Multinomial Naïve Bayes

Classification of user’s review using modified logistic regression technique

Article 06 July 2022

Empirical Evaluation of the BCOC Method on Multi-Domain Sentiment Analysis Data Sets

References

Joachims, T.: Learning to Classify Text Using Support Vector Machines. Kluwer, Dordrecht (2002)
Google Scholar
Apte, C., Damerau, F., Weiss, S.: Automated learning of decision rules for text categorization. Information Systems 12(3), 233–251 (1994)
Google Scholar
Cristianini, N., Taylor, J.S.: An Introduction to Support Vector Machines and other Kernel-based Learning Methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, Heidelberg (2001)
Google Scholar
Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. In: Advances in Large Margin Classifiers, pp. 115–132. MIT Press, Cambridge (2000)
Google Scholar
Koppel, M., Schler, J.: The importance of neutral examples for learning sentiment. In: Workshop on the Analysis of Informal and Formal Information Exchange during Negotiations, FINEXIN (2005)
Google Scholar
Kresel, U.: Pairwise Classification and Support Vector Machines Methods. MIT Press, Cambridge (1999)
Google Scholar
Kudo, T., Matsumoto, Y.: A boosting algorithm for classification of semi-structured text. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 301–308 (2004)
Google Scholar
Lewis, D.: An evaluation of phrasal and clustered representations on a text categorization task. In: Proceedings of SIGIR-1992, 15th ACM International Conference on Research and Development in Information Retrieval, pp. 37–50 (1992)
Google Scholar
Mullen, A., Collier, N.: Sentiment analysis using Support Vector Machines with diverse information sources. In: Proceedings of the 42nd Meeting of the Association for Computational Linguistics, ACL (2004)
Google Scholar
Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL), pp. 271–278 (2004)
Google Scholar
Pang, B., Lee, L.: Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings of the 43nd Meeting of the Association for Computational Linguistics, ACL (2005)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 79–86 (2002)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping, program. Program 14(3), 130–137 (1980)
Google Scholar
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Article Google Scholar
Smola, A., Sch, B.: A tutorial on Support Vector Regression. Technical report, NeuroCOLT2 Technical Report NC2-TR-1998-030 (1998)
Google Scholar
Sorace, A., Keller, F.: Gradience in linguistic data. Lingua 115(11), 1497–1524 (2005)
Article Google Scholar
Taskar, B.: Learning Structured Prediction Models: A Large Margin Approach. PhD thesis, Stanford University (2004)
Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: Machine Learning, Proceedings of the Twenty-first International Conference, ICML (2004)
Google Scholar
Turney, P.D.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Meeting of the Association for Computational Linguistics (ACL), pp. 417–424 (2002)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Tokyo, Hongo, 7-3-1, Bunkyo-ku, Tokyo, 113-0013
Daisuke Okanohara & Jun’ichi Tsujii
CREST, JST, Honcho, 4-1-8, Kawaguchi-shi, Saitama, 332-0012
Jun’ichi Tsujii
School of Informatics, University of Manchester, POBox 88, Sackville St, Manchester, M60 1QD, UK
Jun’ichi Tsujii

Authors

Daisuke Okanohara
View author publications
You can also search for this author in PubMed Google Scholar
Jun’ichi Tsujii
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Language Technology, Macquarie University, 2019, Sydney, NSW, Australia
Robert Dale
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Kam-Fai Wong
Institute for Infocomm Research, 21, Heng Mui Keng Terrace, 119613, Singapore
Jian Su
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Okanohara, D., Tsujii, J. (2005). Assigning Polarity Scores to Reviews Using Machine Learning Techniques. In: Dale, R., Wong, KF., Su, J., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2005. IJCNLP 2005. Lecture Notes in Computer Science(), vol 3651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562214_28

Download citation

DOI: https://doi.org/10.1007/11562214_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29172-5
Online ISBN: 978-3-540-31724-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Assigning Polarity Scores to Reviews Using Machine Learning Techniques

Abstract

Access this chapter

Preview

Similar content being viewed by others

Sentiment Polarity Detection on Bengali Book Reviews Using Multinomial Naïve Bayes

Classification of user’s review using modified logistic regression technique

Empirical Evaluation of the BCOC Method on Multi-Domain Sentiment Analysis Data Sets

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Assigning Polarity Scores to Reviews Using Machine Learning Techniques

Abstract

Access this chapter

Preview

Similar content being viewed by others

Sentiment Polarity Detection on Bengali Book Reviews Using Multinomial Naïve Bayes

Classification of user’s review using modified logistic regression technique

Empirical Evaluation of the BCOC Method on Multi-Domain Sentiment Analysis Data Sets

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation