Post-processing Network Embedding Algorithm with Random Projection and Principal Component Analysis

Computer Science ›› 2021, Vol. 48 ›› Issue (5): 124-129.doi: 10.11896/jsjkx.200500058

• Database & Big Data & Data Science • Previous Articles     Next Articles

Post-processing Network Embedding Algorithm with Random Projection and Principal Component Analysis

HU Xin-tong, SHA Chao-feng, LIU Yan-jun   

  1. School of Computer Science,Fudan University,Shanghai 200433,China
  • Received:2020-05-14 Revised:2020-07-06 Online:2021-05-15 Published:2021-05-09
  • About author:HU Xin-tong,born in 1996,postgra-duate.Her main research interests include natural language processing and software engineering.(
    SHA Chao-feng,born in 1976,Ph.D,associate professor.His main research interests include machine learning and data mining,and natural language processing.
  • Supported by:
    National Key Research and Development Program of China (2018YFB0904503).

Abstract: Network embedding as network representation learning has received a lot of attention from researchers in recent years.A number of models based on low-dimensional vector representation of nodes in network structure learning networks,such as DeepWalk,have been developed with good results in tasks such as node classification and link prediction.However,with the network size increases,there are computational bottlenecks with multiple network embedding algorithms.To mitigate this problem,no-learning methods such as random projection can be used,but critical information about the network structure may be lost,resulting in degraded algorithm performance.In this paper,a post-processing algorithm for network embedding(PPNE) is proposed,which uses random projection as well as principal component analysis to effectively retain key information and maintain a higher order approximation of the network structure.Experiments are conducted on three public datasets for node classification and link prediction tasks,while the performance of the PPNE algorithm is verified against other network embedding algorithms.The experimental results show that the PPNE algorithm has a large improvement over other algorithms in terms of both perfor-mance and running time,and the algorithm has a speed improvement of at least two orders of magnitude over other learning-based algorithms while ensuring good task performance.

Key words: Link prediction, Network embedding, Node classification, Principal component analysis, Random projection

CLC Number: 

  • TP391
