Efficient Distributed k-Clique Mining for Large Networks Using MapReduce

被引:0
|
作者
Shahrivari, Saeed [1 ]
Jalili, Saeed [1 ]
机构
[1] Tarbiat Modares Univ, Tehran 14115111, Iran
关键词
Cloud computing; Data mining; Memory management; Task analysis; Social networking (online); Bioinformatics; Multicore processing; k-clique mining; MapReduce algorithms; parallel graph; COMMUNITY STRUCTURE;
D O I
10.1109/TKDE.2019.2936027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining cliques of a network is an important problem that has many applications in different fields like social networks, bioinformatics, and web analysis. In most applications, mining fixed sized cliques, known as k-cliques, is enough. However, mining cliques of a large network is very challenging using current solutions, and it takes a considerable time using a commodity machine. Also, very large networks cannot be efficiently loaded into memory of a single machine. To overcome these limitations, we have proposed a solution named KCminer, which is based on state space search and can be totally fitted into the MapReduce framework. Using the MapReduce framework, it is possible to run KCminer on cloud computing platforms and hence, process very large networks in feasible time. Our experiments which were performed on a cloud computing platform with 100 machines show that KCminer is both fast and scalable. Besides the MapReduce framework, KCminer executes efficiently on parallel shared memory systems. We performed some experiments on a commodity multicore desktop and showed that KCminer can effectively use the power of all cores. The experimental results show that even using a single thread, KCminer is much faster than available serial tools like MACE.
引用
收藏
页码:964 / 974
页数:11
相关论文
共 50 条
  • [41] Efficient Processing of k Nearest Neighbor Joins using MapReduce
    Lu, Wei
    Shen, Yanyan
    Chen, Su
    Ooi, Beng Chin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (10): : 1016 - 1027
  • [42] Efficient Subgraph Matching on Large RDF Graphs Using MapReduce
    Xin Wang
    Lele Chai
    Qiang Xu
    Yajun Yang
    Jianxin Li
    Junhu Wang
    Yunpeng Chai
    Data Science and Engineering, 2019, 4 : 24 - 43
  • [43] Efficient Subgraph Matching on Large RDF Graphs Using MapReduce
    Wang, Xin
    Chai, Lele
    Xu, Qiang
    Yang, Yajun
    Li, Jianxin
    Wang, Junhu
    Chai, Yunpeng
    DATA SCIENCE AND ENGINEERING, 2019, 4 (01) : 24 - 43
  • [44] Efficient Querying Distributed Big-XML Data using MapReduce
    Song Kunfang
    Hongwei Lu
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 70 - 79
  • [45] Efficient Large-scale Trace Checking Using MapReduce
    Bersani, Marcello M.
    Bianculli, Domenico
    Ghezzi, Carlo
    Krstic, Srdan
    San Pietro, Pierluigi
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 888 - 898
  • [46] Efficient large-scale data analysis using mapreduce
    Kubo, R., 1600, Nippon Telegraph and Telephone Corp. (10):
  • [47] An enhanced and efficient clustering algorithm for large data using MapReduce
    Li, Hongbiao
    Liu, Ruiying
    Wang, Jingdong
    Wu, Qilong
    IAENG International Journal of Computer Science, 2019, 46 (01)
  • [48] Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce (Extended Abstract)
    Zhang, Yanfeng
    Chen, Shimin
    Yu, Ge
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 67 - 68
  • [49] Distributed Top-k Keyword Search over Very Large Databases with MapReduce
    Yu, Ziqiang
    Yu, Xiaohui
    Chen, Yuehui
    Ma, Kun
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 349 - 352
  • [50] An Efficient Distributed Subgraph Mining Algorithm in Extreme Large Graphs
    Wu, Bin
    Bai, YunLong
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 107 - 115