SparkHINlog: Extension of SparkDatalog for heterogeneous information network

被引:1
|
作者
Do Phuc [1 ]
机构
[1] Univ Informat Technol, VNU HCM, Ho Chi Minh City, Vietnam
关键词
Bibliographic network; datalog rules; heterogeneous information networks; meta-path; spark graphframes;
D O I
10.3233/JIFS-179362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data is often interconnected, forming large and complex heterogeneous information networks (HINs) with multiple types of objects and links such as bibliographic network (DBLP) and knowledge bases (YaGo). Querying metapaths requires exploration of path instances which can be computational cost in large HINs. However, existing meta-path based studies mostly focus on analytical applications of meta-paths, rather than systems to query meta-paths efficiently in large HINs. To bridge this gap, in this work we present SparkHINlog, a system based on Apache Spark, to handle meta-paths queries efficiently on large scale HINs. In SparkHINlog we propose an algorithm to not only translate meta-paths to Datalog rules, but also to manage the working memory area of Datalog efficiently to increase the scalability of SparkHlNlog. To avoid the computing overhead of join operation to discover path instances when evaluating these rules, we leverage Motif Finding, a powerful tool of GraphFrames Library. With motif finding, SparkHlNLog can speed up the time to evaluate the rules by path finding on graph instead on joining two relations. We conduct experimental comparisons with SparkDatalog, the state-of-the-art large-scale Datalog system, and verify the efficacy and effectiveness of our system in supporting meta-path queries.
引用
收藏
页码:7555 / 7566
页数:12
相关论文
共 50 条
  • [31] Side Information Fusion for Recommender Systems over Heterogeneous Information Network
    Zhao, Huan
    Yao, Quanming
    Song, Yangqiu
    Kwok, James T.
    Lee, Dik Lun
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (04)
  • [32] Learning heterogeneous information network embeddings via relational triplet network
    Gao, Xiyue
    Chen, Jun
    Zhan, Zexing
    Yang, Shuai
    NEUROCOMPUTING, 2020, 412 : 31 - 41
  • [33] Leveraging heterogeneous information based on heterogeneous network and homophily theory for community recommendations
    Han Chen
    Weiwei Deng
    Electronic Commerce Research, 2023, 23 : 2463 - 2483
  • [34] HeteEdgeWalk: A Heterogeneous Edge Memory Random Walk for Heterogeneous Information Network Embedding
    Liu, Zhenpeng
    Zhang, Shengcong
    Zhang, Jialiang
    Jiang, Mingxiao
    Liu, Yi
    ENTROPY, 2023, 25 (07)
  • [35] The information propagation mechanism of individual heterogeneous adoption behavior under the heterogeneous network
    Cui, Shiru
    Zhu, Xuzhen
    FRONTIERS IN PHYSICS, 2024, 12
  • [36] Leveraging heterogeneous information based on heterogeneous network and homophily theory for community recommendations
    Chen, Han
    Deng, Weiwei
    ELECTRONIC COMMERCE RESEARCH, 2023, 23 (04) : 2463 - 2483
  • [37] A fuzzy extension of VIKOR for target network selection in heterogeneous wireless environments
    Mehbodniya, Abolfazl
    Kaleem, Faisal
    Yen, Kang K.
    Adachi, Fumiyuki
    PHYSICAL COMMUNICATION, 2013, 7 : 145 - 155
  • [38] Heterogeneous Evolution Network Embedding with Temporal Extension for Intelligent Tutoring Systems
    Liu, Sannyuya
    Liu, Shengyingjie
    Yang, Zongkai
    Sun, Jianwen
    Shen, Xiaoxuan
    Li, Qing
    Zou, Rui
    Du, Shangheng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)
  • [39] On MIB Detection under Cell Range Extension in LTE Heterogeneous Network
    Jiang, Zheng
    Yang, Shan
    Chen, Peng
    Yang, Fengyi
    Bi, Qi
    2013 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2013), 2013,
  • [40] An effective heterogeneous information network representation learning framework
    Han, Zhongming
    Jin, Xuelian
    Xing, Haozhen
    Yang, Weijie
    Xiong, Haitao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 66 - 78