Abstract
The popularization of social media exposes the structure of people’s conversation - what kind of people speak with whom, on what topics and with what kinds of words. In this paper, we propose a new approach to mining conversational network by community analysis, which exploits users’ profile information, interaction network and linguistic usage. Using our framework, we conducted empirical analysis on the complex relation among people’s profile information, social network, and language network using a large dataset from Twitter, which covers more than 7M people. Our findings include (1) we can extract a community composed of people who use the same kinds of slangs by exploiting information from both the social network and word usage, (2) when we focus on similarity among communities in terms of both interaction and word usage, we can find specific patterns based on the people’s profile information including their attributes and interests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. Journal of Machine Learning Research 3, 1137–1155 (2003)
Blondel, V., Guillaume, J., Lambiotte, R., Mech, E.: Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 10008–10019 (2008)
Gunnemann, S., Farber, I., Boden, B., Seidl, T.: Subspace clustering meets dense subgraph mining: A synthesis of two paradigms. In: Proceeeindgs of the 10th IEEE International Conference on Data Mining, pp. 845–850. IEEE (2010)
Lim, K.H., Datta, A.: Tweets beget propinquity: Detecting highly interactive communities on twitter using tweeting links. In: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, WI-IAT 2012, vol. 01, pp. 214–221. IEEE Computer Society (2012)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. NIPS 2013 (2013)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York (1986)
Tang, J., Wang, X., Liu, H.: Integrating social media data for community detection. In: Atzmueller, M., Chin, A., Helic, D., Hotho, A. (eds.) MSM/MUSE 2011. LNCS (LNAI), vol. 7472, pp. 1–20. Springer, Heidelberg (2012)
Zhou, Y., Cheng, H., Yu, J.X.: Graph clustering based on structural/attribute similarities. Proceedings of the VLDB Endowment 2(1), 718–729 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Marui, J., Nori, N., Sakaki, T., Mori, J. (2014). Empirical Study of Conversational Community Using Linguistic Expression and Profile Information. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, YS. (eds) Active Media Technology. AMT 2014. Lecture Notes in Computer Science, vol 8610. Springer, Cham. https://doi.org/10.1007/978-3-319-09912-5_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-09912-5_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09911-8
Online ISBN: 978-3-319-09912-5
eBook Packages: Computer ScienceComputer Science (R0)