Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce

被引:0
|
作者
Husain, Mohammad Farhan [1 ]
Doshi, Pankil [1 ]
Khan, Latifur [1 ]
Thuraisingham, Bhavani [1 ]
机构
[1] Univ Texas Dallas, Dallas, TX 75080 USA
来源
CLOUD COMPUTING, PROCEEDINGS | 2009年 / 5931卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Handling huge amount of data scalably is a matter of concern for a long time. Same is true for semantic web data. Current semantic web frameworks lack this ability. In this paper, we describe a framework that we built using Hadoop(1) to store and retrieve large number of RDF(2) triples. We describe our schema to store RDF data in Hadoop Distribute File System. We also present our algorithms to answer a SPARQL(3) query. We make use of Hadoop's MapReduce framework to actually answer the queries. Our results reveal that we can store huge amount of semantic web data in Hadoop clusters built mostly by cheap commodity class hardware and still can answer queries fast enough. We conclude that ours is a scalable framework, able to handle large amount of RDF data efficiently.
引用
收藏
页码:680 / 686
页数:7
相关论文
共 50 条
  • [21] Enumerating Maximal Bicliques from a Large Graph using MapReduce
    Mukherjee, Arko Provo
    Tirthapura, Srikanta
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 707 - 714
  • [22] Enumerating Maximal Bicliques from a Large Graph Using MapReduce
    Mukherjee, Arko Provo
    Tirthapura, Srikanta
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2017, 10 (05) : 771 - 784
  • [23] Review of large-scale RDF data processing in mapreduce
    Hou, Ke
    Zhang, Ming
    Fang, Xing
    Journal of Software Engineering, 2015, 9 (01): : 195 - 202
  • [24] Data Analysis using Hadoop MapReduce Environment
    Merla, PrathyushaRani
    Liang, Yiheng
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4783 - 4785
  • [25] Clustering on Big Data Using Hadoop MapReduce
    Akthar, Nadeem
    Ahamad, Mohd Vasim
    Khan, Shahbaz
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
  • [26] Smart RDF Data storage in Graph Databases
    De Virgilio, Roberto
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 872 - 881
  • [27] Fast execution of RDF queries using Apache Hadoop
    Mazumdar, Somnath
    Scionti, Alberto
    ADVANCES IN COMPUTERS, VOL 119, 2020, 119 : 1 - 33
  • [28] Optimizing the Hadoop MapReduce Framework with high-performance storage devices
    Moon, Sangwhan
    Lee, Jaehwan
    Sun, Xiling
    Kee, Yang-suk
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (09): : 3525 - 3548
  • [29] Optimizing the Hadoop MapReduce Framework with high-performance storage devices
    Sangwhan Moon
    Jaehwan Lee
    Xiling Sun
    Yang-suk Kee
    The Journal of Supercomputing, 2015, 71 : 3525 - 3548
  • [30] Nesting Strategies for Enabling Nimble MapReduce Dataflows for Large RDF Data
    Ravindra, Padmashree
    Anyanwu, Kemafor
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2014, 10 (01) : 1 - 26