SparkHINlog: Extension of SparkDatalog for heterogeneous information network

被引:1
|
作者
Do Phuc [1 ]
机构
[1] Univ Informat Technol, VNU HCM, Ho Chi Minh City, Vietnam
关键词
Bibliographic network; datalog rules; heterogeneous information networks; meta-path; spark graphframes;
D O I
10.3233/JIFS-179362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data is often interconnected, forming large and complex heterogeneous information networks (HINs) with multiple types of objects and links such as bibliographic network (DBLP) and knowledge bases (YaGo). Querying metapaths requires exploration of path instances which can be computational cost in large HINs. However, existing meta-path based studies mostly focus on analytical applications of meta-paths, rather than systems to query meta-paths efficiently in large HINs. To bridge this gap, in this work we present SparkHINlog, a system based on Apache Spark, to handle meta-paths queries efficiently on large scale HINs. In SparkHINlog we propose an algorithm to not only translate meta-paths to Datalog rules, but also to manage the working memory area of Datalog efficiently to increase the scalability of SparkHlNlog. To avoid the computing overhead of join operation to discover path instances when evaluating these rules, we leverage Motif Finding, a powerful tool of GraphFrames Library. With motif finding, SparkHlNLog can speed up the time to evaluate the rules by path finding on graph instead on joining two relations. We conduct experimental comparisons with SparkDatalog, the state-of-the-art large-scale Datalog system, and verify the efficacy and effectiveness of our system in supporting meta-path queries.
引用
收藏
页码:7555 / 7566
页数:12
相关论文
共 50 条
  • [41] Prediction of ESG compliance using a heterogeneous information network
    Hisano, Ryohei
    Sornette, Didier
    Mizuno, Takayuki
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [42] A Malware Detection System Based on Heterogeneous Information Network
    Yin, Shang-Nan
    Kang, Ho-Seok
    Chen, Zhi-Guo
    Kim, Sung-Ryul
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 154 - 159
  • [43] Author Name Disambiguation Based on Heterogeneous Information Network
    Qiping D.
    Weijing C.
    Ling J.
    Yu’e Z.
    Data Analysis and Knowledge Discovery, 2022, 6 (04) : 60 - 68
  • [44] Recent Developments of Deep Heterogeneous Information Network Analysis
    Shi, Chuan
    Yu, Philip S.
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2973 - 2974
  • [45] Personalized Entity Recommendation: A Heterogeneous Information Network Approach
    Yu, Xiao
    Ren, Xiang
    Sun, Yizhou
    Gu, Quanquan
    Sturt, Bradley
    Khandelwal, Urvashi
    Norick, Brandon
    Han, Jiawei
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 283 - 292
  • [46] Comparative Analysis of Similarity Measures in Heterogeneous Information Network
    Patil, Vaishali
    Vasappanavara, Ramesh
    Ghorpade, Tushar
    PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2017), 2017, : 297 - 301
  • [47] Type Sequence Preserving Heterogeneous Information Network Embedding
    Chen, Yuxin
    Wang, Tengjiao
    Chen, Wei
    Li, Qiang
    Qiu, Zhen
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9931 - 9932
  • [48] Event Coreference Resolution Based On Heterogeneous Information Network
    Sun, Tao
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2014, 7 (06): : 1 - 10
  • [49] Fast computation of General SimRank on heterogeneous information network
    Zhang, Chuanyan
    Hong, Xiaoguang
    Zheng, Yongqing
    DISCOVER COMPUTING, 2024, 27 (01)
  • [50] Finding Dimensions for Text Based on Heterogeneous Information Network
    Jiang, Fei
    Hong, Xiaoguang
    Peng, Zhaohui
    Li, Qingzhong
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 819 - 823