Abstract
Given the wide spread of social networks, research efforts to retrieve information using tagging from social networks communications have increased. In particular, in Twitter social network, hashtags are widely used to define a shared context for events or topics. While this is a common practice often the hashtags freely introduced by the user become easily biased. In this paper, we propose to deal with this bias defining semantic meta-hashtags by clustering similar messages to improve the classification. First, we use the user-defined hashtags as the Twitter message class labels. Then, we apply the meta-hashtag approach to boost the performance of the message classification.
The meta-hashtag approach is tested in a Twitter-based dataset constructed by requesting public tweets to the Twitter API. The experimental results yielded by comparing a baseline model based on user-defined hashtags with the clustered meta-hashtag approach show that the overall classification is improved. It is concluded that by incorporating semantics in the meta-hashtag model can have impact in different applications, e.g. recommendation systems, event detection or crowdsourcing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Efron, M.: Hashtag retrieval in a microblogging environment. In: SIGIR, pp. 787–788 (2010)
Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Semantic enrichment of twitter posts for user profile construction on the social web. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part II. LNCS, vol. 6644, pp. 375–389. Springer, Heidelberg (2011)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860 (2010)
Popescu, A.-M., Pennacchiotti, M.: Detecting controversial events from twitter. In: CIKM, pp. 1873–1876 (2010)
Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: Real-world event identification on twitter. In: ICWSM (2011)
Ovadia, S.: Exploring the Potential of Twitter as a Research Tool. Behavioral & Social Sciences Librarian 28(4), 202–205 (2009)
Rowe, M., Stankovic, M.: Mapping tweets to conference talks: A goldmine for semantics. In: Workshop on Social Data on the Web, SDoW (2010)
Weller, K., Puschmann, C.: Twitter for Scientific Communication: How Can Citations/References be Identified and Measured?, pp. 1–4 (2011), http://journal.webscience.org/500/
Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Analyzing user modeling on twitter for personalized news recommendations. In: Proceedings of the 19th International Conference on User Modeling, Adaption, and Personalization, pp. 1–12 (2011)
Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: What 140 characters reveal about political sentiment. In: Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media, pp. 178–185 (2010)
O’Connor, B., Balasubramanyan, R., Routledge, B.R., Smith, N.A.: From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In: Proceedings of the International AAAI Conference on Weblogs and Social Media (2010)
Huang, J., Thornton, K.M., Efthimiadis, E.N.: Conversational tagging in twitter. In: Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, pp. 173–178 (2010)
Merriam-webster’s dictionary (October 2012), www.merriam-webster.com/
Zappavigna, M.: Ambient affiliation: A linguistic perspective on Twitter. New Media & Society 13(5), 788–806 (2011)
Johnson, S.: How twitter will change the way we live. Time Magazine 173, 23–32 (2009)
Tsur, O., Rappoport, A.: “What’s in a hashtag?: content based prediction of the spread of ideas in microblogging communities”. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012, pp. 643–652 (2012)
Yang, L., Sun, T., Zhang, M., Mei, Q.: We know what @you #tag: does the dual role affect hashtag adoption? In: Proceedings of the 21st International Conference on World Wide Web, WWW 2012, pp. 261–270 (2012)
Chang, H.-C.: A new perspective on twitter hashtag use: diffusion of innovation theory. In: Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem, ASIS&T 2010, vol. 47, pp. 85:1–85:4 (2010)
Doerr, B., Fouz, M., Friedrich, T.: Why rumors spread so quickly in social networks. Commun. ACM 55(6), 70–75 (2012)
Tantipathananandh, C., Berger-Wolf, T.Y.: Finding Communities in Dynamic Social Networks. In: 2011 IEEE 11th International Conference on Data Mining, pp. 1236–1241 (2011)
Treiber, M., Schall, D., Dustdar, S., Scherling, C.: Tweetflows: flexible workflows with twitter. In: Proceedings of the 3rd International Workshop on Principles of Engineering Service-Oriented Systems, pp. 1–7 (2011)
Zangerle, E., Gassler, W., Specht, G.: Recommending #-tags in twitter. In: Proceedings of the Workshop on Semantic Adaptive Social Web, in Connection with the 19th International Conference on User Modeling, Adaptation and Personalization, UMAP 2011, pp. 67-78 (2011)
Oguztuzun, H., Ozdikis, O., Senkul, P.: Semantic Expansion of Hashtags for Enhanced Event Detection in Twitter. In: VLDB The First International Workshop on Online Social Systems (2012)
Koerich, A.: Improving classification performance using metaclasses. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 717–722 (2003)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer (1999)
Joachims, T.: Learning Text Classifiers with Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)
Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research 2, 45–66 (2002)
Costa, J., Silva, C., Antunes, M., Ribeiro, B.: On using crowdsourcing and active learning to improve classification performance. In: Proceedings of 11th International Conference on Intelligent Systems Design and Applications (ISDA), pp. 469–474 (2011)
van Rijsbergen, C.: Information Retrieval. Butterworths (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Costa, J., Silva, C., Antunes, M., Ribeiro, B. (2013). Defining Semantic Meta-hashtags for Twitter Classification. In: Tomassini, M., Antonioni, A., Daolio, F., Buesser, P. (eds) Adaptive and Natural Computing Algorithms. ICANNGA 2013. Lecture Notes in Computer Science, vol 7824. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37213-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-37213-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37212-4
Online ISBN: 978-3-642-37213-1
eBook Packages: Computer ScienceComputer Science (R0)