Abstract
Heterogeneous graph neural networks (HGNNs) can effectively model multiple node types and complex interactions in real networks and solve problems in various practical applications. Self-supervised Learning-based HGNNs (SL-HGNNs) have become the current research focus in this field because they can solve the problem of difficult label acquisition in practical scenarios. These methods usually split heterogeneous graphs into multiple subgraphs based on meta-paths for separate study. However, they ignore the complex interactions between the different semantics of the graphs. In addition, they use node features as auxiliary information for heterogeneous graph representation learning and ignore the importance of features. To solve the above problems, we propose a Self-supervised Learning on Heterogeneous Graph Neural Network via Semantic strength and Feature similarity (HetGNN-SF) model. This model innovatively implements a Feature- and Topology-based Comparative optimization (FTC) method to generate weights for different meta-paths and then splits the original heterogeneous graph into different semantic subgraphs based on meta-paths. Thereafter, the different subgraphs are fused by the FTC to generate semantic fusion graphs that capture the interactions between different semantics. Semantic strength and feature similarity perspectives generate node embeddings from the semantic fusion graphs. Finally, the FTC positive and negative samples are used for contrastive learning from the two perspectives to yield the final node embeddings. Extensive experiments are conducted on three real datasets using the proposed HetGNN-SF model; the results reveal that HetGNN-SF outperforms state-of-the-art models. Our data and code are available on GitHub (https://github.com/LiuXMaa/HetGNN-SF-.git).









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The datasets used in the experiments are publicly available in the online repository.
References
Stankova M, Praet S, Martens D, Provost F (2021) Node classification over bipartite graphs through projection. Mach Learn 110(1):37–87
Wu M, Pan S, Zhu X (2021) Openwgl: open-world graph learning for unseen class node classification. Knowl Inf Syst 63(9):2405–2430
Fan S, Zhu J, Han X, Shi C, Hu L, Ma B, Li Y (2019) Metapath-guided heterogeneous graph neural network for intent recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining pp. 2478–2486
Liu T, He Z (2022) Dlir: a deep learning-based initialization recommendation algorithm for trust-aware recommendation. Appl Intell 1–12
Lin R, Tang Y, Yuan C, He C, Li W (2022) Scholat link prediction: A link prediction dataset fusing topology and attribute information. In: CCF Conference on Computer Supported Cooperative Work and Social Computing pp. 340–351. Springer
Li X, Shang Y, Cao Y, Li Y, Tan J, Liu Y (2020) Type-aware anchor link prediction across heterogeneous networks based on graph attention network. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 34, pp. 147–155
Shi C, Li Y, Zhang J, Sun Y, Philip SY (2016) A survey of heterogeneous information network analysis. IEEE Trans Knowl Data Eng 29(1):17–37
Schlichtkrull M, Kipf TN, Bloem P, Berg R, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European Semantic Web Conference vol. 10843, pp. 593–607. Springer
Wang R, Shi C, Zhao T, Wang X, Ye YF (2021) Heterogeneous information network embedding with adversarial disentangler. IEEE Trans Knowl Data Eng
Park C, Kim D, Han J, Yu H (2020) Unsupervised attributed multiplex network embedding. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 34, pp. 5371–5378
Wang X, Liu N, Han H, Shi C (2021) Self-supervised heterogeneous graph neural network with co-contrastive learning. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 1726–1736
Lin Z, Tian C, Hou Y, Zhao WX (2022) Improving graph collaborative filtering with neighborhood-enriched contrastive learning. In: Proceedings of the ACM Web Conference 2022 pp. 2320–2329
Wang X, Ji H, Shi C, Wang B, Ye Y, Cui P, Yu PS (2019) Heterogeneous graph attention network. In: The World Wide Web Conference pp. 2022–2032
Jiang X, Jia T, Fang Y, Shi C, Lin Z, Wang H (2021) Pre-training on large-scale heterogeneous graph. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 756–766
Wang C, Zhou S, Yu K, Chen D, Li B, Feng Y, Chen C (2022) Collaborative knowledge distillation for heterogeneous information network embedding. In: Proceedings of the ACM Web Conference 2022 pp. 1631–1639
Wang Y, Zhang Z, Liu T, Xu H, Wang J, Guo L (2020) Fine-grained semantics-aware heterogeneous graph neural networks. In: International Conference on Web Information Systems Engineering pp. 71–82. Springer
Jin M, Zheng Y, Li Y, Gong C, Zhou C, Pan S (2021) Multi-scale contrastive siamese networks for self-supervised graph representation learning. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021 pp. 1477–1483
Welling M, Kipf TN (2017) Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations, ICLR
Velickovic P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2018) Graph attention networks. In: Proceedings of the 6th International Conference on Learning Representations, ICLR
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 30:1024–1034
Wang X, Zhu M, Bo D, Cui P, Shi C, Pei J (2020) Am-gcn: adaptive multi-channel graph convolutional networks. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining pp. 1243–1253
Sun Y, Deng H, Yang Y, Wang C, Xu J, Huang R, Cao L, Wang Y, Chen L (2022) Beyond homophily: structure-aware path aggregation graph neural network. In: Raedt, L.D. (ed.) Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022 pp. 2233–2240
Ji H, Wang X, Shi C, Wang B, Yu P (2021) Heterogeneous graph propagation network. IEEE Trans Knowl Data Eng 1–12
Fu X, Zhang J, Meng Z, King I (2020) Magnn: Metapath aggregated graph neural network for heterogeneous graph embedding. In: Proceedings of The Web Conference 2020 pp. 2331–2341
Yun S, Jeong M, Kim R, Kang J, Kim HJ (2019) Graph transformer networks. Adv Neural Inf Process Syst 32:11960–11970
Zhang C, Song D, Huang C, Swami A, Chawla NV (2019) Heterogeneous graph neural network. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 793–803
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Hu Z, Dong Y, Wang K, Sun Y (2020) Heterogeneous graph transformer. In: Proceedings of The Web Conference 2020 pp. 2704–2710
Lv Q, Ding M, Liu Q, Chen Y, Feng W, He S, Zhou C, Jiang J, Dong Y, Tang J (2021) Are we really making much progress? revisiting, benchmarking and refining heterogeneous graph neural networks. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining pp. 1150–1160
Zhao J, Wang X, Shi C, Hu B, Song G, Ye Y (2021) Heterogeneous graph structure learning for graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 35, pp. 4697–4705
Yang Y, Guan Z, Li J, Zhao W, Cui J, Wang Q (2021) Interpretable and efficient heterogeneous graph convolutional network. IEEE Trans Knowl Data Eng 1–14
Zhang M, Wang X, Zhu M, Shi C, Zhang Z, Zhou J (2022) Robust heterogeneous graph neural networks against adversarial attacks. In: Proceedings of the AAAI Conference on Artificial Intelligence vol. 36, pp. 4363–4370
Liu X, Zhang F, Hou Z, Mian L, Wang Z, Zhang J, Tang J (2021) Self-supervised learning: generative or contrastive. IEEE Trans Knowl Data Eng 35(1):857–876
Velickovic P, Fedus W, Hamilton WL, Liò P, Bengio Y, Hjelm RD (2019) Deep graph infomax. ICLR (Poster) 2(3):4
Ren Y, Liu B, Huang C, Dai P, Bo L, Zhang J (2019) Heterogeneous deep graph infomax. arXiv:1911.08538
Zhao J, Wang X, Shi C, Liu Z, Ye Y (2020) Network schema preserving heterogeneous information network embedding. In: International Joint Conference on Artificial Intelligence (IJCAI) pp. 1366–1372
Jiang X, Lu Y, Fang Y, Shi C (2021) Contrastive pre-training of gnns on heterogeneous graphs. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management pp. 803–812
He D, Liang C, Huo C, Feng Z, Jin D, Yang L, Zhang W (2022) Analyzing heterogeneous networks with missing attributes by unsupervised contrastive learning. IEEE Trans Neural Netw Learn Syst
Zhu Y, Xu Y, Cui H, Yang C, Liu Q, Wu S (2022) Structure-enhanced heterogeneous graph contrastive learning. In: Proceedings of the 2022 SIAM International Conference on Data Mining (SDM) pp. 82–90. SIAM
Dong Y, Chawla NV, Swami A (2017) metapath2vec: scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 135–144
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 701–710
Hussein R, Yang D, Cudré-Mauroux P (2018) Are meta-paths necessary? revisiting heterogeneous graph embeddings. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management pp. 437–446
He D, Zhai L, Li Z, Di Jin 0001, Yang L, Huang Y, Philip SY (2020) Adversarial mutual information learning for network embedding. In: IJCAI pp. 3321–3327
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11)
Acknowledgements
This research is supported by the National Natural Science Foundation of China (Grant No.62072288, 61702306), the Taishan Scholar Program of Shandong Province, the Natural Science Foundation of Shandong Province (Grant No.ZR2022MF268).
Funding
This study was funded by the National Natural Science Foundation of China (Grant No.62072288, 61702306), the Taishan Scholar Program of Shandong Province, the Natural Science Foundation of Shandong Province (Grant No.ZR2022MF268).
Author information
Authors and Affiliations
Contributions
Chao Li, Xinming Liu, Zhongying Zhao wrote the main manuscript text; Xinming Liu and Yeyu Yan prepared the result of our experiments; All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Consent to participate
There is the consent of all authors.
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, C., Liu, X., Yan, Y. et al. HetGNN-SF: Self-supervised learning on heterogeneous graph neural network via semantic strength and feature similarity. Appl Intell 53, 21902–21919 (2023). https://doi.org/10.1007/s10489-023-04612-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-04612-6