Abstract
Heterogeneous link prediction aims to reveal potential connections between two nodes in heterogeneous information networks. Most existing studies are based on meta-paths, but ignore the information contained in incomplete meta-paths. They simply aggregate meta-paths, leading to mining semantic information insufficiently. To solve this problem, we propose a link prediction model based on enhanced meta-path aggregation and attention mechanism. In this model, the deficiency of missing topological information from incomplete meta-paths is compensated by aggregating structural features and semantics. Different from existing meta-path encoders, we use recurrent neural networks and the attention mechanism to learn explicit and implicit semantic knowledge from meta-paths, which can capture more complex semantic associations between nodes. In addition, to avoid duplicate feature acquisition by random walking, we design a novel bidirectional biased random walking algorithm. It is applied to guide the generation of heterogeneous neighbors of each node that contain features ignored by the meta-path-wise model, which can mine complete topological information and get more accurate link prediction results. The extensive experiments on several datasets demonstrate that the proposed model outperforms baselines.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Some or all data, models, or codes generated or used during the study are available from the corresponding author by request. They are also available in a repository or online.
References
Hao P, Jianxin LI, Yangqiu S, Renyu Y, Ranjan R, Yu PS, Lifang HE (2021) Streaming social event detection and evolution discovery in heterogeneous information networks. ACM Trans Knowl Discov Data 15(5):1–33
Alrabea A, Alzubi O, Alzubi J (2020) An enhanced mac protocol design o prolong sensor network lifetime. Int J Commun Antenna Propag (IRECAP) 10:37
Babu MV, Alzubi JA, Sekaran R, Patan R, Ramachandran M, Gupta D (2021) An improved IDAF-FIT clustering based ASLPP-RR routing with secure data aggregation in wireless sensor network. Mob Netw Appl 26(3):1059–1067
Pu C, Li J, Wang J, Quek TQS (2022) The node-similarity distribution of complex networks and its applications in link prediction. IEEE Trans Knowl Data Eng 34(8):4011–4023
Ma G, Yan H, Qian Y, Wang L, Dang C, Zhao Z (2021) Path-based estimation for link prediction. Int J Mach Learn Cybern 12(9):2443–2458
Yuan W, Han Y, Guan D, Han G, Tian Y, Al-Dhelaan A, Al-Dhelaan M (2022) Weighted enclosing subgraph-based link prediction for complex network. EURASIP J Wirel Commun Netw 2022(1):1–14
Li C, Wei W, Feng X, Liu J (2021) Research of motif-based similarity for link prediction problem. IEEE Access 9:66636–66645
Förster Y-P, Gamberi L, Tzanis E, Vivo P, Annibale A (2022) Exact and approximate mean first passage times on trees and other necklace structures: a local equilibrium approach. J Phys: Math Theor 55(11):1–33
Shan N, Li L, Zhang Y, Bai S, Chen X (2020) Supervised link prediction in multiplex networks. Knowledge-Based Syst. https://doi.org/10.1016/j.knosys.2020.106168
Zhu Z, Yan M, Deng X, Gao M (2022) Rating prediction of recommended item based on review deep learning and rating probability matrix factorization. Electron Commer Res Appl. https://doi.org/10.1016/j.elerap.2022.101160
Gul H, Amin A, Adnan A, Huang K (2021) A systematic analysis of link prediction in complex network. IEEE Access 9:20531–20541
Kumar S, Panda BS, Aggarwal D (2021) Community detection in complex networks using network embedding and gravitational search algorithm. J Intell Inf Syst 57(1):51–72
Wang Z, Ye X, Wang C, Cui J, Yu PS (2021) Network embedding with completely-imbalanced labels. IEEE Trans Knowl Data Eng 33(11):3634–3647
Zou J, Du Z, Zhao S (2022) Multi-granular attributed network representation learning. Int J Mach Learn Cybern 13(7):2071–2087
Li H, Wang Y, Lyu Z, Shi J (2022) Multi-task learning for recommendation over heterogeneous information network. IEEE Trans Knowl Data Eng 34(2):789–802
Raveendran AP, Alzubi JA, Sekaran R, Ramachandran M (2021) A high performance scalable fuzzy based modified asymmetric heterogene multiprocessor system on chip (AHt-MPSOC) reconfigurable architecture. J Intell Fuzzy Syst 42:1–12
Ruiz L, Gama F, Ribeiro A (2021) Graph neural networks: architectures, stability, and transferability. Proc IEEE 109(5):660–682
Yang F, Zhang H, Tao S (2022) Hybrid deep graph convolutional networks. Int J Mach Learn Cybern 13(8):2239–2255
Liang F, Qian C, Yu W, Griffith D, Golmie N (2022) Survey of graph neural networks and applications. Wirel Commun Mob Comput. https://doi.org/10.1155/2022/9261537
Shengsheng Q, Jun HU, Quan F, Changsheng XU (2021) Knowledge-aware multi-modal adaptive graph convolutional networks for fake news detection. ACM Trans Multimed Comput Commun Appl 17(3):1–23
Zhang T, Shan H-R, Little MA (2022) Causal GraphSAGE: a robust graph method for classification based on causal sampling. Pattern Recognit 128:108696
Mo X, Huang Z, Xing Y, Lv C (2022) Multi-agent trajectory prediction with heterogeneous edge-enhanced graph attention network. IEEE Trans Intell Transport Syst 23(7):9554–9567
Fu X, Zhang J, Meng Z, King I (2020) MAGNN: metapath aggregated graph neural network for heterogeneous graph embedding. Proceedings of The Web Conference 2020. Association for Computing Machinery: Taipei, Taiwan, pp 2331–2341
Zhang D, Yin J, Zhu X, Zhang C (2018) MetaGraph2Vec: complex semantic path augmented heterogeneous network embedding. Springer International Publishing, Cham, pp 196–208
Zhang C, Shang K-K, Qiao J (2021) Adaptive similarity function with structural features of network embedding for missing link prediction. Complexity. https://doi.org/10.1155/2021/1277579
Zhou L-H, Wang J-L, Wang L-Z, Chen H-M, Kong B (2022) Heterogeneous information network representation learning: a survey. Jisuanji Xuebao/Chinese J Comput 45(1):160–189
Chen J, Huang F, Peng J (2021) MSGCN: multi-subgraph based heterogeneous graph convolution network embedding. Appl Sci (2076-3417) 11(21):9832
Zheng S, Guan D, Yuan W (2022) Semantic-aware heterogeneous information network embedding with incompatible meta-paths. World Wide Web 25(1):1
Banan A, Nasiri A, Taheri-Garavand A (2020) Deep learning-based appearance features extraction for automated carp species identification. Aquac Eng 89:102053
Tang Z, Wang H, Yi X, Zhang Y, Kwong S, Kuo CJ (2023) Joint graph attention and asymmetric convolutional neural network for deep image compression. IEEE Trans Circuits Syst Video Technol 33(1):421–433
Wang J, Zhao C, He S, Gu Y, Alfarraj O, Abugabah A (2022) LogUAD: log unsupervised anomaly detection based on word2Vec. Comput Syst Sci Eng 41(3):1207–1222
Mei J, Wang Y, Tu X, Dong M, He T (2023) Incorporating BERT with probability-aware gate for spoken language understanding. IEEE/ACM Trans Audio Speech Lang Process 31:826–834
Li RQ, Zhao X, Moens MF (2023) A brief overview of universal sentence representation methods: a linguistic view. Acm Comput Surv 55(3):1
Weibin C, Danial S, Guoxi L, Shahab SB, Kwok Wing C, Amir M (2022) Accurate discharge coefficient prediction of streamlined weirs by coupling linear regression and deep convolutional gated recurrent unit. Eng Appl Comput Fluid Mech 16(1):965–976
Wang W-C, Du Y-J, Chau K-W, Xu D-M, Liu C-J, Ma Q (2021) An ensemble hybrid forecasting model for annual runoff based on sample entropy, secondary decomposition, and long short-term memory neural network. Water Resour Manag 35(14):4695–4726
Alzubi JA, Jain R, Kathuria A, Khandelwal A, Saxena A, Singh A (2020) Paraphrase identification using collaborative adversarial networks. J Intell Fuzzy Syst 39(1):1021–1032
Wei H, Zhou A, Zhang Y, Chen F, Qu W, Lu M (2022) Biomedical event trigger extraction based on multi-layer residual BiLSTM and contextualized word representations. Int J Mach Learn Cybern 13(3):721–733
Fan Y, Xu K, Wu H, Zheng Y, Tao B (2020) Spatiotemporal modeling for nonlinear distributed thermal processes based on KL decomposition, MLP and LSTM network. IEEE Access 8:25111–25121
Chengcheng C, Qian Z, Mahsa HK, Changhyun J, Sayed MB, Shahab SB, Sonam Sandeep D, Kwok-Wing C (2022) Forecast of rainfall distribution based on fixed sliding window long short-term memory. Eng Appl Comput Fluid Mech 16(1):248–261
Wang J, Li H, Liang L, Zhou Y (2022) Community discovery algorithm of complex network attention model. Int J Mach Learn Cybern 13(6):1619–1631
Kazemi B, Abhari A (2020) Content-based Node2Vec for representation of papers in the scientific literature. Data Knowl Eng 127:101794
Baptista A, Gonzalez A, Baudot A (2022) Universal multilayer network exploration by random walk with restart. Commun Phys 5(1):1–9
Jiang J-Y, Li Z, Ju CJT, Wang W (2020) In MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning, 29th ACM International Conference on Information and Knowledge Management, CIKM 2020, October 19, 2020 - October 23, 2020, Virtual, Online, Ireland, Association for Computing Machinery: Virtual, Online, Ireland, pp. 575–584.
Xu L, He Z, Wang K, Wang C, Huang S (2022) Explicit message-passing heterogeneous graph neural network. IEEE Trans Knowl Data Eng 99:1–13
Haitham AA, Ahmedbahaaaldin IAO, Yusuf E, Ali NA, Yuk FH, Ozgur K, Mohsen S, Ahmed S, Kwok-wing C, Ahmed E-S (2021) Modeling the fluctuations of groundwater level by employing ensemble deep learning techniques. Eng Appl Comput Fluid Mech 15(1):1420–1439
Zeng K, Liu J, Jiang Z, Xu D (2022) A decreasing scaling transition scheme from Adam to SGD. Adv Theory Simul 5(7):1–10
Cai L, Li J, Wang J, Ji S (2022) Line graph neural networks for link prediction. IEEE Trans Pattern Anal Mach Intell 44(9):5103–5113
Singh SS, Mishra S, Kumar A, Biswas B (2022) Link prediction on social networks based on centrality measures. Springer Sci Bus Media Deutschland GmbH 246:71–89
Funding
National Science Foundation of China, 61602491, Lunwen Wang.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shao, H., Wang, L. & Zhu, R. Link prediction for heterogeneous information networks based on enhanced meta-path aggregation and attention mechanism. Int. J. Mach. Learn. & Cyber. 14, 3087–3103 (2023). https://doi.org/10.1007/s13042-023-01822-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-023-01822-9