Abstract
Community detection is to partition a network into several components, each of which contains densely connected nodes with some structural similarities. Recently, multiplex networks, each layer consisting of a same node set but with a different topology by a unique edge type, have been proposed to model real-world multi-relational networks. Although some heuristic algorithms have been extended into multiplex networks, little work on neural models have been done so far. In this paper, we propose a graph convolutional fusion model (GCFM) for community detection in multiplex networks, which takes account of both intra-layer structural and inter-layer relational information for learning node representation in an interwoven fashion. In particular, we first develop a graph convolutional auto-encoder for each network layer to encode neighbor-aware intra-layer structural features under different convolution scales. We next design a multiscale fusion network to learn a holistic version of nodes’ representations by fusing nodes’ encodings at different layers and different scales. Finally, a self-training mechanism is used to train our model and output community divisions. Experiment results on both synthetic and real-world datasets indicate that the proposed GCFM outperforms the state-of-the-art techniques in terms of better detection performances.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Availability of data and materials
All data generated or analysed during this study are included in this published article.
Notes
In order to distinguish the concept of "network layer", we use "scale" in the context of graph convolutional networks.
References
Ali HT, Liu S, Yilmaz Y, Couillet R, Rajapakse I, Hero A (2019) Latent heterogeneous multilayer community detection. In: Proceedings of the international conference on acoustics, speech and signal processing, pp 8142–8146. IEEE
Berlingerio M, Coscia M, Giannotti F (2011) Finding and characterizing communities in multidimensional networks. In: Proceedings of international conference on advances in social networks analysis and mining, pp 490–494. IEEE
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech: Theory Exp 2008(10):10008
Bouguessa M, Wang S, Dumoulin B (2010) Discovering knowledge-sharing communities in question-answering forums. ACM Trans Knowl Discov Data 5(1):1–49
Boutemine O, Bouguessa M (2017) Mining community structures in multidimensional networks. ACM Trans Knowl Discov Data 11(4):1–36
Bo D, Wang X, Shi C, Zhu M, Lu E, Cui P (2020) Structural deep clustering network. In: Proceedings of the World Wide Web conference, pp 1400–1410
Bródka P (2016) A method for group extraction and analysis in multilayer social networks. arXiv preprint arXiv:1612.02377
Cao J, Jin D, Yang L, Dang J (2018) Incorporating network structure with node contents for community detection on large networks using deep learning. Neurocomputing 297:71–81
Chang H, Feng Z, Ren Z (2016) Community detection using dual-representation chemical reaction optimization. IEEE Transn Cybern 47(12):4328–4341
Chen BL, Hall DH, Chklovskii DB (2006) Wiring optimization can relate neuronal structure and function. Proc Natl Acad Sci 103(12):4723–4728
Chen Z, Chen C, Zheng Z, Zhu Y (2019) Tensor decomposition for multilayer networks clustering. In: Proceedings of the AAAI conference on artificial intelligence, pp 3371–3378
Chen H, Perozzi B, Hu Y, Skiena S (2018) Harp: Hierarchical representation learning for networks. In: Proceedings of the AAAI conference on artificial intelligence, vol 32
Coleman J, Katz E, Menzel H (1957) The diffusion of an innovation among physicians. Sociometry 20(4):253–270
De Domenico M, Solé-Ribalta A, Gómez S, Arenas A (2014) Navigability of interconnected networks under random failures. Proc Natl Acad Sci 111(23):8351–8356
Eagle N, Pentland AS (2006) Reality mining: sensing complex social systems. Pers Ubiquit Comput 10(4):255–268
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174
Gao X, Zheng Q, Verri FA, Rodrigues RD, Zhao L (2019) Particle competition for multilayer network community detection. In: Proceedings of the 11th international conference on machine learning and computing, pp 75–80
Garcia JO, Ashourvan A, Muldoon S, Vettel JM, Bassett DS (2018) Applications of community detection techniques to brain graphs: algorithmic considerations and implications for neural function. Proc IEEE 106(5):846–867
Gligorijević V, Panagakis Y, Zafeiriou S (2019) Non-negative matrix factorizations for multiplex network analysis. IEEE Trans Pattern Anal Mach Intell 41(4):928–940
Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 855–864
Huang X, Chen D, Ren T, Wang D (2021) A survey of community detection methods in multilayer networks. Data Min Knowl Disc 35(1):1–45
Interdonato R, Tagarelli A, Ienco D, Sallaberry A, Poncelet P (2017) Local community detection in multilayer networks. Data Min Knowl Disc 31(5):1444–1479
Jia Y, Zhang Q, Zhang W, Wang X (2019) Communitygan: Community detection with generative adversarial nets. In: Proceedings of the World Wide Web conference, pp 784–794
Jing B, Park C, Tong H (2021) Hdmi: High-order deep multiplex infomax. In: Proceedings of the web conference 2021, pp 2414–2424
Jin D, Liu Z, Li W, He D, Zhang W (2019) Graph convolutional networks meet markov random fields: Semi-supervised community detection in attribute networks. In: Proceedings of the AAAI conference on artificial intelligence, pp 152–159
Kuang D, Ding C, Park H (2012) Symmetric nonnegative matrix factorization for graph clustering. In: Proceedings of the international conference on data mining, pp 106–117. SIAM
Liu Q, Wang B (2022) Neural extraction of multiscale essential structure for network dismantling. Neural Netw 154:99–108
Liu Q, Wang B, Qi J, Deng X (2022) A new centrality measure based on neighbor loop structure for network dismantling. Digit Commun Netw
Liu F, Xue S, Wu J, Zhou C, Hu W, Paris C, Nepal S, Yang J, Yu PS (2020) Deep learning for community detection: Progress, challenges and opportunities. In: Proceedings of the 29th international joint conference on artificial intelligence, pp. 4981–4987. International Joint Conferences on Artificial Intelligence Organization
Ma X, Dong D, Wang Q (2018) Community detection in multi-layer networks using joint nonnegative matrix factorization. IEEE Trans Knowl Data Eng 31(2):273–286
Magalingam P, Davis S, Rao A (2015) Using shortest path to discover criminal community. Digit Investig 15:1–17
Magnani M, Hanteer O, Interdonato R, Rossi L, Tagarelli A (2021) Community detection in multiplex networks. ACM Comput Surv 54(3):1–35
Magnani M, Micenkova B, Rossi L (2013) Combinatorial analysis of multiple networks. arXiv preprint arXiv:1303.4986
Magnani M, Rossi L (2011) The ml-model for multi-layer social networks. In: ASONAM, pp 5–12. IEEE Computer Society
Mercorio F, Mezzanzanica M, Moscato V, Picariello A, Sperli G (2019) Dico: A graph-db framework for community detection on big scholarly data. IEEE Trans Emerg Top Comput 9(4):1987–2003
Mucha PJ, Richardson T, Macon K, Porter MA, Onnela J-P (2010) Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980):876–878
Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113
Park C, Kim D, Han J, Yu H (2020) Unsupervised attributed multiplex network embedding. Proc AAAI Conf Artif Intell 34:5371–5378
Paul S, Chen Y (2022) Null models and community detection in multi-layer networks. Sankhya A 84(1):163–217
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Pramanik S, Tackx R, Navelkar A, Guillaume J-L, Mitra B (2017) Discovering community structure in multilayer networks. In: 2017 IEEE international conference on data science and advanced analytics (DSAA), pp 611–620. IEEE
Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106
Shao Z, Ma L, Lin Q, Li J, Gong M, Nandi AK (2022) Pmcdm: privacy-preserving multiresolution community detection in multiplex networks. Knowl-Based Syst 244:108542
Song H, Thiagarajan JJ (2019) Improved deep embeddings for inferencing with multi-layered graphs. In: Proceedings of the international conference on big data, pp 5394–5400. IEEE
Souravlas S, Anastasiadou S, Katsavounis S (2021) A survey on the recent advances of deep community detection. Appl Sci 11(16):7179
Sperlí G (2019) A deep learning based community detection approach. In: Proceedings of the 34th ACM/SIGAPP symposium on applied computing, pp 1107–1110
Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M (2006) Biogrid: a general repository for interaction datasets. Nucleic Acids Res 34(suppl_1), 535–539
Suthers D, Fusco J, Schank P, Chu K-H, Schlager M (2013) Discovery of community structures in a heterogeneous professional online network. In: 2013 46th Hawaii international conference on system sciences, pp 3262–3271. IEEE
Su X, Xue S, Liu F, Wu J, Yang J, Zhou C, Hu W, Paris C, Nepal S, Jin D, et al (2022) A comprehensive survey on community detection with deep learning. IEEE Trans Neural Netw Learn Syst
Tagarelli A, Amelio A, Gullo F (2017) Ensemble-based community detection in multilayer networks. Data Min Knowl Disc 31(5):1506–1543
Tang L, Wang X, Liu H (2009) Uncoverning groups via heterogeneous interaction analysis. In: Proceedings of the 9th international conference on data mining, pp 503–512. IEEE
Tu C, Zeng X, Wang H, Zhang Z, Liu Z, Sun M, Zhang B, Lin L (2018) A unified framework for community detection and network representation learning. IEEE Trans Knowl Data Eng 31(6):1051–1065
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11):2579–2605
Velickovic P, Fedus W, Hamilton WL, Liò P, Bengio Y, Hjelm RD (2019) Deep graph infomax. ICLR (Poster) 2(3):4
Wang C, Pan S, Hu R, Long G, Jiang J, Zhang C (2019) Attributed graph clustering: a deep attentional embedding approach. In: Proceedings of the 28th international joint conference on artificial intelligence, pp 3670–367. International Joint Conferences on Artificial Intelligence Organization
Xia L, Huang C, Xu Y, Dai P, Zhang X, Yang H, Pei J, Bo L (2021) Knowledge-enhanced hierarchical graph transformer network for multi-behavior recommendation. Proc AAAI Conf Artif Intell 35:4486–4493
Xie Y, Gong M, Wang S, Yu B (2018) Community discovery in networks with deep sparse filtering. Pattern Recogn 81:50–59
Xie J, Girshick R, Farhadi A (2016) Unsupervised deep embedding for clustering analysis. In: Proceedings of the international conference on machine learning, pp 478–487. PMLR
Ye F, Chen C, Zheng Z (2018) Deep autoencoder-like nonnegative matrix factorization for community detection. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp 1393–1402
Zhang H, Wang C-D, Lai J-H, Philip SY (2017) Modularity in complex multilayer networks with multiple aspects: a static perspective. In: Proceedings of the applied informatics, pp 1–29. SpringerOpen
Funding
This work is supported in part by National Natural Science Foundation of China (Grant No: 62172167).
Author information
Authors and Affiliations
Contributions
XC: methodology, software, experimentation, result analysis, writing—original draft, visualization. BW: Conceptualization, methodology, result analysis, writing—review editing, supervision.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethics approval
Not applicable.
Consent to participate
Yes.
Consent for publication
Yes.
Additional information
Responsible editor: Jingrui He.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A
Appendix A
In this Appendix, we discuss the characteristics of multilayer modularity and the encoding distance with a node corresponding to the same entity in different layers.
Theorem 1
Given the modularity of each single layer \(Q_{s}=\sum \limits _{ij} [(\textbf{A}_{ijs} - \gamma _{s} \frac{d_{is} d_{js}}{2m_{s}})] \delta \left( g_{i}, \right. \left. g_{j}\right) \), the multilayer modularity will degenerate into the sum of multiple single layer modularity, i.e. \(\sum \limits _{s} Q_{s}\), if a node is assigned with only one community label in its all layers of a multiplex network.
For the multilayer modularity defined by Eq.(21), it allows that a node in each layer can have its own label; While GCFM and some other baseline methods (PMM, MDLPA, CSNMF, DMGI, HDMI) assign a community label to a node in its belonging layers. In order to analyze the GCFM, we can rewrite Eq.(21) as:
For layer s, r and node i, j, the Eq.(A1) can be divided into four cases:
\(\bullet \) Case1: \(s=r\) and \(i=j\) (\(\delta \left( g_{i}, g_{j}\right) =1\) and \(C_{jsr}=0\)).
\(\bullet \) Case2: \(s=r\) and \(i \ne j\).
\(\bullet \) Case3: \(s \ne r\) and \(i = j\) (\(\delta \left( g_{i}, g_{j}\right) =1\)).
\(\bullet \) Case4: \(s \ne r\) and \(i \ne j\).
Therefore, \(Q_{m}\) can be represented as \(Q^{(1)}+Q^{(2)}+Q^{(3)}+Q^{(4)}\). \(Q^{(3)}\) and \(Q^{(4)}\) are constants and \(C_{jsr}\) is proven ineffective in our problem setting. It works on the case that each node in each layer is assigned with an independent community label. \(Q^{(1)}\) and \(Q^((2))\) can be merged as:
which means the sum of each single layer modularity \(Q_{s}=\sum \limits _{ij} [(\textbf{A}_{ijs} - \gamma _{s} \frac{d_{is} d_{js}}{2m_{s}})] \delta \left( g_{i}, g_{j}\right) \). To sum up, our model is to design an optimization function \(g(\cdot )\) to maximize the sum of monoplex modularity:
Theorem 2
The GCA can decrease the encoding distance of same node in different layers, i.e. \(\Vert \hat{\textbf{h}}_{i,1}-\hat{\textbf{h}}_{i,2}\Vert _{2}\), if they have similar topological structures in their own layers.
Given a multiplex network, we focus on the k-th GCA scale. Denote \(\textbf{h}_{i,l}\) as the encoding of node i of the l-th layer (denoted as (i,l)) in \((k-1)\)-th scale and we have \(\hat{\textbf{h}}_{i,l}=ReLU(\sum _{j \in N_{i,l}} \frac{\textbf{h}_{j,l}}{\sqrt{d_{i,l}}\sqrt{d_{j,l}}}\textbf{W})\), where \(N_{i,l}\) is the neighborhoods of node (i,l). Assume \(ReLU(\cdot )\) as \(\sigma (x)=x\) and \(\textbf{W}=\textbf{I}\). \(\hat{\textbf{h}}_{i,l}\) can be regarded as the average of neighbor encodings. Now we focus on node i in a two-layer multiplex network, i.e. \(\hat{\textbf{h}}_{i,1}\) and \(\hat{\textbf{h}}_{i,2}\). For \(\hat{\textbf{h}}_{i,1}\), it can be divided into three parts: (a) Self node encoding \(\frac{\textbf{h}_{i,l}}{d_{i,l}}\). (b) The encodings of generalized common neighbors of (i,1) and (i,2), where generalized common neighbors mean that nodes correspond to the same entity. It can be represented as \(\frac{\textbf{B}_{1}}{\sqrt{d_{i,1}}}\), where \(\textbf{B}_{1}=\sum _{u \in N_{i,1} \cap N_{i,2}} \frac{\textbf{h}_{u,1}}{\sqrt{d_{u,1}}}\) is the sum of common neighbor encodings. (c) The other non-common neighbor encodings, denoted as \(\frac{\textbf{D}_{1}}{\sqrt{d_{i,1}}}\), where \(\textbf{D}_{1}=\sum _{v \in N_{i,1} - N_{i,1} \cap N_{i,2}} \frac{\textbf{h}_{v,1}}{\sqrt{d_{v,1}}}\). Assume each node in the two layer has the same initial encoding in \((k-1)\)-th scale, i.e. \(\textbf{h}_{i,1}=\textbf{h}_{i,2}\). Then we measure the distance of \(\hat{\textbf{h}}_{i,1}\) and \(\hat{\textbf{h}}_{i,2}\):
From Eq.(A8) we can derive the upper bound of the distance between \(\hat{\textbf{h}}_{i,1}\) and \(\hat{\textbf{h}}_{i,2}\). The first term measures the degree difference of (i,1) and (i,2), where \(\vert \frac{d_{i,2}-d_{i,1}}{d_{i,1}d_{i,2}}\vert \le 1\). If \(d_{i,1} \approx d_{i,2}\), the effect of the first term can be ignored; While if the two nodes have no similar degrees such as \(d_{i,2} \gg d_{i,1} = 1\), it may achieve the upper bound 1. The second term represents the influence of the common neighbors. For each item \(\vert \frac{\sqrt{d_{i,2}d_{u,2}}-\sqrt{d_{i,1}d_{u,1}}}{\sqrt{d_{i,1}d_{u,1}d_{i,2}d_{u,2}}}\vert \le 1\), we assume (i,1) and (i,2) have similar degrees. Specially, if \(d_{u,2} \gg d_{u,1}\), this item will become \(\vert \frac{1}{\sqrt{d_{i,1}d_{u,1}}} \vert \) which is controlled by the degree of node (i,1) and the degree of its common neighbor (u,1), both normally larger than 1. So usually \(\vert \frac{\sqrt{d_{i,2}d_{u,2}}-\sqrt{d_{i,1}d_{u,1}}}{\sqrt{d_{i,1}d_{u,1}d_{i,2}d_{u,2}}}\vert < 1\). The third term shows the influence of the non-common neighbors. For the upper bound of each item \(\vert \frac{1}{\sqrt{d_{i,1}d_{v,1}}} \vert \) and \(\vert \frac{1}{\sqrt{d_{i,2}d_{v,1}}} \vert \), if \(d_{v,1} \gg d_{i,1}\) and \(d_{v,2} \gg d_{i,2}\), the effect of the third term can be ignored. Besides, there is certain restrictive correlation between the second term and the third term, which is \(d_{i,1}=\vert u \in N_{i,1} \cap N_{i,2} \vert + \vert v \in N_{i,1} - N_{i,1} \cap N_{i,2} \vert \).
In a multiplex network, we hope that the same node in different layers have approximate performance if they have similar topology so that we can fuse their encodings and obtain a consistent community label. Equation(A8) proves that our GCA can decrease the encoding distance of same node in different layers with similar topology, i.e node degree, common neighbor and one-hop neighbor degree, which will be benefit for the task of community detection.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Cai, X., Wang, B. A graph convolutional fusion model for community detection in multiplex networks. Data Min Knowl Disc 37, 1518–1547 (2023). https://doi.org/10.1007/s10618-023-00932-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-023-00932-w