Dynamic Heterogeneous Network Embedding Based on Non-Decreasing Temporal Random Walk

被引:0
|
作者
Guo J. [1 ,2 ]
Bai Q. [1 ,2 ]
Lin Z. [1 ]
Song C. [1 ,2 ]
Yuan X. [1 ,2 ]
机构
[1] College of Cyber Science, Nankai University, Tianjin
[2] Tianjin Key Laboratory of Network and Data Security Technology(Nankai University), Tianjin
来源
Song, Chunyao (chunyao.song@dbis.nankai.edu.cn) | 1624年 / Science Press卷 / 58期
基金
中国国家自然科学基金;
关键词
Dynamic network; Heterogeneous information network; Incremental learning; Network embedding; Random walk;
D O I
10.7544/issn1000-1239.2021.20210317
中图分类号
学科分类号
摘要
Network embedding is an important work as a representation learning method for mapping high-dimensional networks to low-dimensional vector spaces. Some researches have been carried out on dynamic homogeneous network embedding and static network embedding. But there are still fewer studies for embedding on dynamic heterogeneous information networks (DHINs). If we directly apply static network embedding methods or dynamic homogeneous network embedding methods to solve the DHIN embedding problem, it will lead to serious information loss due to ignoring the dynamic or heterogeneous properties of the network. Therefore, we propose a DHIN embedding method called TNDE, which is based on time- and category-constrained random walk. The method adopts category constraints to solve the problem of preserving semantic information in DHINs. Moreover, unlike the temporal random walk in other dynamic network embedding methods, TNDE uses non-decreasing temporal constraints to incrementally perform random walk. It can solve the problem that edges on local structures with strong semantics have the same timestamps due to the simultaneous existence of dynamic and heterogeneous properties in DHIN and avoid being trapped in the same timestamps during walking. TNDE provides an efficient online representation learning algorithm by adopting incremental walking and incremental representation learning for real-time changes. Experimental results on three real datasets show that TNDE has good generality in networks with different characteristics and significantly improves embedding quality, which outperforms state-of-the-art methods by 2.4%~92.7% in downstream link prediction and node classification tasks. Moreover, TNDE reduces the algorithm runtime by 12.5%~99.91% with good embedding quality. © 2021, Science Press. All right reserved.
引用
收藏
页码:1624 / 1641
页数:17
相关论文
共 42 条
  • [1] Cui Peng, Wang Xiao, Pei Jian, Et al., A survey on network embedding, IEEE Transactions on Knowledge and Data Engineering, 31, 5, pp. 833-852, (2019)
  • [2] Dong Yuxiao, Chawla N V, Swami A., Metapath2Vec: Scalable representation learning for heterogeneous networks, Proc of the 23rd ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, pp. 135-144, (2017)
  • [3] Lee J B, Nguyen G, Rossi R A, Et al., Dynamic node embeddings from edge streams, IEEE Transactions on Emerging Topics in Computational Intelligence, (2020)
  • [4] Tang Jie, Zhang Jing, Yao Limin, Et al., Arnetminer: Extraction and mining of academic social networks, Proc of the 14th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, pp. 990-998, (2008)
  • [5] Sun Yizhou, Han Jiawei, Yan Xifeng, Et al., Pathsim: Meta path-based top-k similarity search in heterogeneous information networks, Proceedings of the VLDB Endowment, 4, 11, pp. 992-1003, (2011)
  • [6] Chen Haochen, Perozzi B, AI-Rfou R, Et al., A tutorial on network embeddings, (2018)
  • [7] Zhang Daokun, Yin Jie, Zhu Xingquan, Et al., Network representation learning: A survey, IEEE Transactions on Big Data, 6, 1, pp. 3-28, (2020)
  • [8] Hamilton W L, Ying R, Leskovec J., Representation learning on graphs: Methods and applications, (2018)
  • [9] Shi Chuan, Li Yitong, Zhang Jiawei, Et al., A survey of heterogeneous information network analysis, IEEE Transactions on Knowledge and Data Engineering, 29, 1, pp. 17-37, (2017)
  • [10] Yang C, Xiao Yuxin, Zhang Yu, Et al., Heterogeneous network representation learning: Survey, benchmark, evaluation, and beyond[J], (2020)