Abstract
With the surge of social media, micro-blog has become a popular information share tool, in which retweeting is a basic way to share and spread information. It is important to predict the retweet number for influence measure and precision market. Contemporary methods usually consider it as a classification or regression problem directly, which can be regarded as one-phase models. However, they cannot accurately predict the number of retweet. In this paper, we propose a two-phase model to predict how many times a tweet can be retweeted in Sina Weibo. That is, the model first classifies tweets into several categories, and then does regression on each category. Extensive experiments on real Sina Weibo dataset show that our model is a general framework to achieve better performances than traditional one-phase prediction model without complex feature extraction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Twitter, http://en.wikipedia.org/wiki/Twitter
Sina Weibo, http://en.wikipedia.org/wiki/Sina_Weibo
Petrovic, S., Osborne, M., Lavrenko, V.: RT to Win! Predicting Message Propagation in Twitter. In: ICWSM (2011)
Ma, H., Qian, W., Xia, F., et al.: Towards modeling popularity of microblogs. J. Frontiers of Computer Science 7(2), 171–184 (2013)
Yang, Z., Guo, J., Cai, K., Tang, J., Li, J., Zhang, L., Su, Z.: Understanding retweeting behaviors in social networks. In: Proc. of CIKM, pp. 1633–1636 (2010)
Yang, J., Counts, S.: Predicting the speed, scale, and range of information diffusion in twitter. In: ICWSM (2010)
Peng, H., Zhu, J., Piao, D., Yan, R., Zhang, Y.: Retweet Modeling Using Conditional Random Fields. In: ICDM Workshops, pp. 336–343 (2011)
Zhang, Y., Lu, R., Yang, Q.: Predicting Retweeting in Microblogs. Journal of Chinese Information Processing 26(4), 109–114 (2012)
Bandari, R., Asur, S., Huberman, B.: The pulse of news in social media: forecasting popularity. In: ICWSM (2012)
Li, Y., Yu, H., Liu, L.: Predict algorithm of micro-blog retweet scale based on SVM. Application Research of Computers 30(9), 2594–2597 (2013)
Java, A., Song, X., Finin, T., et al.: Why we twitter: understanding microblogging usage and communities. In: WebKDD, pp. 56–65 (2007)
Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: Conversational aspects of retweeting on Twitter. In: 43rd Hawaii International Conf. on System Sciences, pp. 1–10 (2010)
Suh, B., Hong, L., Pirolli, P., Chi, E.H.: Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network. In: SocialCom/PASSAT, pp. 177–184 (2010)
Rousseeuw, P.J., Leroy, A.M.: Robust regression and outlier detection (1987)
Wang, Y., Witten, I.H.: Induction of model trees for predicting continuous classes. In: ECML, pp. 128–137 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, G., Shi, C., Chen, Q., Wu, B., Qi, J. (2014). A Two-Phase Model for Retweet Number Prediction. In: Li, F., Li, G., Hwang, Sw., Yao, B., Zhang, Z. (eds) Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science, vol 8485. Springer, Cham. https://doi.org/10.1007/978-3-319-08010-9_84
Download citation
DOI: https://doi.org/10.1007/978-3-319-08010-9_84
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08009-3
Online ISBN: 978-3-319-08010-9
eBook Packages: Computer ScienceComputer Science (R0)