SparkHINlog: Extension of SparkDatalog for heterogeneous information network

被引:1
|
作者
Do Phuc [1 ]
机构
[1] Univ Informat Technol, VNU HCM, Ho Chi Minh City, Vietnam
关键词
Bibliographic network; datalog rules; heterogeneous information networks; meta-path; spark graphframes;
D O I
10.3233/JIFS-179362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data is often interconnected, forming large and complex heterogeneous information networks (HINs) with multiple types of objects and links such as bibliographic network (DBLP) and knowledge bases (YaGo). Querying metapaths requires exploration of path instances which can be computational cost in large HINs. However, existing meta-path based studies mostly focus on analytical applications of meta-paths, rather than systems to query meta-paths efficiently in large HINs. To bridge this gap, in this work we present SparkHINlog, a system based on Apache Spark, to handle meta-paths queries efficiently on large scale HINs. In SparkHINlog we propose an algorithm to not only translate meta-paths to Datalog rules, but also to manage the working memory area of Datalog efficiently to increase the scalability of SparkHlNlog. To avoid the computing overhead of join operation to discover path instances when evaluating these rules, we leverage Motif Finding, a powerful tool of GraphFrames Library. With motif finding, SparkHlNLog can speed up the time to evaluate the rules by path finding on graph instead on joining two relations. We conduct experimental comparisons with SparkDatalog, the state-of-the-art large-scale Datalog system, and verify the efficacy and effectiveness of our system in supporting meta-path queries.
引用
收藏
页码:7555 / 7566
页数:12
相关论文
共 50 条
  • [21] Embedding Heterogeneous Information Network in Hyperbolic Spaces
    Zhang, Yiding
    Wang, Xiao
    Liu, Nian
    Shi, Chuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [22] Heterogeneous Information Network Embedding With Adversarial Disentangler
    Wang, Ruijia
    Shi, Chuan
    Zhao, Tianyu
    Wang, Xiao
    Ye, Yanfang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1581 - 1593
  • [23] Universal Network Representation for Heterogeneous Information Networks
    Hu, Ruiqi
    Yu, Celina Ping
    Fung, Sai-Fu
    Pan, Shirui
    Wang, Haishuai
    Long, Guodong
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 388 - 395
  • [24] AHINE: Adaptive Heterogeneous Information Network Embedding
    Lin, Yucheng
    Hong, Huiting
    Yang, Xiaoqing
    Gong, Pinghua
    Li, Zang
    Ye, Jieping
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 100 - 107
  • [25] Heterogeneous Information Network Embedding for Mention Recommendation
    Yi, Feng
    Jiang, Bo
    Wu, Jianjun
    IEEE ACCESS, 2020, 8 : 91394 - 91404
  • [26] Heterogeneous Information Network Embedding for Mention Recommendation
    Yi, Feng
    Jiang, Bo
    Wu, Jianjun
    IEEE Access, 2020, 8 : 91394 - 91404
  • [27] Sequential Recommendation on Dynamic Heterogeneous Information Network
    Xie, Tao
    Xu, Yangjun
    Chen, Liang
    Liu, Yang
    Zheng, Zibin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2105 - 2110
  • [28] HeteSpaceyWalk: A Heterogeneous Spacey Random Walk for Heterogeneous Information Network Embedding
    He, Yu
    Song, Yangqiu
    Li, Jianxin
    Ji, Cheng
    Peng, Jian
    Peng, Hao
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 639 - 648
  • [29] Extension of Analog Network Coding in Wireless Information Exchange
    Chen, Cheng
    Huang, Jiaqing
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349
  • [30] An Extension of Radio Network Information Interfaces for Connectivity Management
    Pencheva, Evelina
    2018 21ST CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN), 2018,