Abstract
In this paper we test various weighted local similarity network measures for predicting the future content of tweets. Our aim is to determine the most suitable measure for predicting new content in tweets and subsequently explore the spreading positively and negatively oriented content on Twitter. The tweets in the English language were collected via the Twitter API depending on their content. That is, we searched for the tweets containing specific predefined keywords from different domains - positive or negative. From the gathered tweets the weighted complex network of words is formed, where nodes represent words and a link between two nodes exists if these two words co-occur in the same tweet, while the weight denotes the co-occurrence frequency. For the link prediction task we study five local similarity network measures commonly used in unweighted networks (Common Neighbors, Jaccard Coefficient, Preferential Attachment, Adamic Adar and Resource Allocation Index) which we have adapted to weighted networks. Finally, we evaluated all the modified measures in terms of the precision of predicted links. The obtained results suggest that the Weighted Resource Allocation Index has the best potential for the prediction of content in tweets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Stopwords are a list of the most common, short function words that do not carry strong semantic properties, but are needed for the syntax of a language (pronouns, prepositions, conjunctions, abbreviations, ...).
References
Twitter - Wikipedia, the free encyclopedia. 20-Feb-2016. https://en.wikipedia.org/wiki/Twitter. Accessed 21 Feb 2016
Huberman, B.A., Romero, D.M., Wu, F.: Social networks that matter: Twitter under the microscope. arXiv:0812.1045 [physics] (2008)
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in twitter: the million follower fallacy. In: ICWSM, vol. 10, pp. 10–17 (2010)
Bollen, J., Mao, H., Zeng, X.J.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)
Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: what 140 characters reveal about political sentiment. In: Fourth International AAAI Conference on Weblogs and Social Media (2010)
Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: conversational aspects of retweeting on twitter. In: 2010 43rd Hawaii International Conference on System Sciences (HICSS), pp. 1–10 (2010)
Mathioudakis, M., Koudas, N.: TwitterMonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 1155–1158 (2010)
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project. Report 1, 12 (2009)
Lu, L., Zhou, T.: Role of Weak Ties in Link Prediction of Complex Networks. arXiv:0907.1728 [cs] (2009)
De Sá, H.R., Prudêncio, R.B.: Supervised link prediction in weighted networks. In: The 2011 International Joint Conference on Neural Networks (IJCNN), pp. 2281–2288 (2011)
Yang, Y., Lichtenwalter, R.N., Chawla, N.V.: Evaluating link prediction methods. Knowl. Inf. Syst. 45(3), 751–782 (2015)
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, pp. 2200–2204 (2010)
GET search/tweets — Twitter Developers. https://dev.twitter.com/rest/reference/get/search/tweets. Accessed 21 Feb 2016
Schult, D.A., Swart, P.: Exploring network structure, dynamics, and function using NetworkX. In: Proceedings of the 7th Python in Science Conferences (SciPy 2008), pp. 11–16 (2008)
Margan, D., Meštrović, A.: LaNCoA: a python toolkit for language networks construction and analysis. In: International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp. 1628–1633 (2015)
Yuan, G., Murukannaiah, P.K., Zhang, Z., Singh, M.P.: Exploiting sentiment homophily for link prediction. In: Proceedings of the 8th ACM Conference on Recommender Systems, pp. 17–24 (2014)
Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 57–58
Shi, L., Agarwal, N., Agrawal, A., Garg, R., Spoelstra, J.: Predicting US primary elections with Twitter (2012)
Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: what 140 characters reveal about political sentiment. In: ICWSM, vol. 10, pp. 178–185
Liben-Nowel, D., Kleinberg, J.: The link prediction problem for social networks. In: Proceedings of the CKIM, pp. 556–559 (2003)
Burnap, P., Gibson, R., Sloan, L., Southern, R., Williams, M.: 140 characters to victory?: Using Twitter to predict the UK 2015 General Election. arXiv:1505.01511 [physics] (2015)
Singla, P., Richardson, M.: Yes, there is a correlation: from social networks to personal behavior on the Web. In: Proceedings of the 17th International Conference on World Wide Web, pp. 655–664 (2008)
Valverde-Rebaza, J., de Andrade Lopes, A.: Exploiting behaviors of communities of twitter users for link prediction. Soc. Netw. Anal. Mining 3(4), 1063–1074 (2013)
Bliss, C.A., Frank, M.R., Danforth, C.M., Dodds, P.S.: An evolutionary algorithm approach to link prediction in dynamic social networks. J. Comput. Sci. 5(5), 750–764 (2011)
Lu, L., Zhou, T.: Link prediction in complex networks: a survey. Phys. A 390(6), 1150–1170 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Martinčić-Ipšić, S., Močibob, E., Meštrović, A. (2016). Link Prediction on Tweets’ Content. In: Dregvaite, G., Damasevicius, R. (eds) Information and Software Technologies. ICIST 2016. Communications in Computer and Information Science, vol 639. Springer, Cham. https://doi.org/10.1007/978-3-319-46254-7_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-46254-7_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46253-0
Online ISBN: 978-3-319-46254-7
eBook Packages: Computer ScienceComputer Science (R0)