Efficient Distributed k-Clique Mining for Large Networks Using MapReduce

被引：0

作者：

Shahrivari, Saeed ^{[1
]}

Jalili, Saeed ^{[1
]}

机构：

[1] Tarbiat Modares Univ, Tehran 14115111, Iran

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2021年 / 33卷 / 03期

关键词：

Cloud computing; Data mining; Memory management; Task analysis; Social networking (online); Bioinformatics; Multicore processing; k-clique mining; MapReduce algorithms; parallel graph; COMMUNITY STRUCTURE;

D O I：

10.1109/TKDE.2019.2936027

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Mining cliques of a network is an important problem that has many applications in different fields like social networks, bioinformatics, and web analysis. In most applications, mining fixed sized cliques, known as k-cliques, is enough. However, mining cliques of a large network is very challenging using current solutions, and it takes a considerable time using a commodity machine. Also, very large networks cannot be efficiently loaded into memory of a single machine. To overcome these limitations, we have proposed a solution named KCminer, which is based on state space search and can be totally fitted into the MapReduce framework. Using the MapReduce framework, it is possible to run KCminer on cloud computing platforms and hence, process very large networks in feasible time. Our experiments which were performed on a cloud computing platform with 100 machines show that KCminer is both fast and scalable. Besides the MapReduce framework, KCminer executes efficiently on parallel shared memory systems. We performed some experiments on a commodity multicore desktop and showed that KCminer can effectively use the power of all cores. The experimental results show that even using a single thread, KCminer is much faster than available serial tools like MACE.

引用

页码：964 / 974

页数：11

共 50 条

[41] Efficient Processing of k Nearest Neighbor Joins using MapReduce
Lu, Wei
Shen, Yanyan
Chen, Su
Ooi, Beng Chin
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (10): : 1016 - 1027
[42] Efficient Subgraph Matching on Large RDF Graphs Using MapReduce
Xin Wang
Lele Chai
Qiang Xu
Yajun Yang
Jianxin Li
Junhu Wang
Yunpeng Chai
Data Science and Engineering, 2019, 4 : 24 - 43
[43] Efficient Subgraph Matching on Large RDF Graphs Using MapReduce
Wang, Xin
Chai, Lele
Xu, Qiang
Yang, Yajun
Li, Jianxin
Wang, Junhu
Chai, Yunpeng
DATA SCIENCE AND ENGINEERING, 2019, 4 (01) : 24 - 43
[44] Efficient Querying Distributed Big-XML Data using MapReduce
Song Kunfang
Hongwei Lu
INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 70 - 79
[45] Efficient Large-scale Trace Checking Using MapReduce
Bersani, Marcello M.
Bianculli, Domenico
Ghezzi, Carlo
Krstic, Srdan
San Pietro, Pierluigi
2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 888 - 898
[46] Efficient large-scale data analysis using mapreduce
Kubo, R., 1600, Nippon Telegraph and Telephone Corp. (10):
[47] An enhanced and efficient clustering algorithm for large data using MapReduce
Li, Hongbiao
Liu, Ruiying
Wang, Jingdong
Wu, Qilong
IAENG International Journal of Computer Science, 2019, 46 (01)
[48] Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce (Extended Abstract)
Zhang, Yanfeng
Chen, Shimin
Yu, Ge
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 67 - 68
[49] Distributed Top-k Keyword Search over Very Large Databases with MapReduce
Yu, Ziqiang
Yu, Xiaohui
Chen, Yuehui
Ma, Kun
2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 349 - 352
[50] An Efficient Distributed Subgraph Mining Algorithm in Extreme Large Graphs
Wu, Bin
Bai, YunLong
ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 107 - 115

← 1 2 3 4 5 →