SparkHINlog: Extension of SparkDatalog for heterogeneous information network

被引:1
|
作者
Do Phuc [1 ]
机构
[1] Univ Informat Technol, VNU HCM, Ho Chi Minh City, Vietnam
关键词
Bibliographic network; datalog rules; heterogeneous information networks; meta-path; spark graphframes;
D O I
10.3233/JIFS-179362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data is often interconnected, forming large and complex heterogeneous information networks (HINs) with multiple types of objects and links such as bibliographic network (DBLP) and knowledge bases (YaGo). Querying metapaths requires exploration of path instances which can be computational cost in large HINs. However, existing meta-path based studies mostly focus on analytical applications of meta-paths, rather than systems to query meta-paths efficiently in large HINs. To bridge this gap, in this work we present SparkHINlog, a system based on Apache Spark, to handle meta-paths queries efficiently on large scale HINs. In SparkHINlog we propose an algorithm to not only translate meta-paths to Datalog rules, but also to manage the working memory area of Datalog efficiently to increase the scalability of SparkHlNlog. To avoid the computing overhead of join operation to discover path instances when evaluating these rules, we leverage Motif Finding, a powerful tool of GraphFrames Library. With motif finding, SparkHlNLog can speed up the time to evaluate the rules by path finding on graph instead on joining two relations. We conduct experimental comparisons with SparkDatalog, the state-of-the-art large-scale Datalog system, and verify the efficacy and effectiveness of our system in supporting meta-path queries.
引用
收藏
页码:7555 / 7566
页数:12
相关论文
共 50 条
  • [1] Incorporating Temporal Information for Recommendation in Heterogeneous Information Network
    Ling, Yanxiang
    Liang, Zheng
    Yang, Wenjing
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 103 - 107
  • [2] A Survey of Heterogeneous Information Network Analysis
    Shi, Chuan
    Li, Yitong
    Zhang, Jiawei
    Sun, Yizhou
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 17 - 37
  • [3] HINE: Heterogeneous Information Network Embedding
    Chen, Yuxin
    Wang, Chenguang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 180 - 195
  • [4] Heterogeneous Information Network for Person Recognition
    Kim, Hye-Jin
    Kim, DoHyung
    Oh, Jin-Tae
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 723 - 724
  • [5] Temporal Heterogeneous Information Network Embedding
    Huang, Hong
    Shi, Ruize
    Zhou, Wei
    Wang, Xiao
    Jin, Hai
    Fu, Xiaoming
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1470 - 1476
  • [6] Hyperbolic Heterogeneous Information Network Embedding
    Wang, Xiao
    Zhang, Yiding
    Shi, Chuan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5337 - 5344
  • [7] Heterogeneous Information Network Embedding for Recommendation
    Shi, Chuan
    Hu, Binbin
    Zhao, Wayne Xin
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 357 - 370
  • [8] Ranking on Network of Heterogeneous Information Networks
    Xu, Zhe
    Zhang, Si
    Xia, Yinglong
    Xiong, Liang
    Tong, Hanghang
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 848 - 857
  • [9] Research on the Extension of SCTP Protocol on the Heterogeneous Wireless Network
    Yuan, Yao
    Zhang, Dalin
    Tian, Lin
    Shi, Jinglin
    INTERNATIONAL JOURNAL OF INTERDISCIPLINARY TELECOMMUNICATIONS AND NETWORKING, 2016, 8 (02) : 69 - 87
  • [10] Network Schema Preserving Heterogeneous Information Network Embedding
    Zhao, Jianan
    Wang, Xiao
    Shi, Chuan
    Liu, Zekuan
    Ye, Yanfang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1366 - 1372