SOFTWARE ARCHITECTURE RECOVERY THROUGH SIMILARITY-BASED GRAPH CLUSTERING

被引:5
|
作者
Zhu, Jianlin [1 ]
Huang, Jin [2 ]
Zhou, Daicui [1 ]
Yin, Zhongbao [1 ]
Zhang, Guoping [3 ]
He, Qiang [4 ]
机构
[1] Cent China Normal Univ, Minist Educ, Key Lab Quark & Lepton Phys, Wuhan 430079, Peoples R China
[2] China Shipbldg Ind Corp, Res Inst 709, Wuhan 430070, Peoples R China
[3] Cent China Normal Univ, Coll Phys Sci & Technol, Wuhan 430079, Peoples R China
[4] Swinburne Univ Technol, Fac Informat & Commun Technol, Melbourne, Vic 3122, Australia
关键词
Software architecture recovery; hierarchy graph clustering; similarity-based clustering; multiple stable layers; ALGORITHMS;
D O I
10.1142/S0218194013500162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software architecture recovery is to gain the architectural level understanding of a software system while its architecture description does not exist. In recent years, researchers have adopted various software clustering techniques to detect hierarchical structure of software systems. Most graph clustering techniques focus on the connectivity between program elements, but unreasonably ignore the similarity which is also a key measure for finding elements of one module. In this paper we propose a novel hierarchy graph clustering algorithm DGHC, which considers both similarity and connectivity between program elements. During the transformation of program dependence graph edges representing similarity between elements are added. Then similar elements are grouped by density-based approaches. The alternative strategy is adopted to find groups of closely connected and similar elements. Meanwhile we adjust the contribution of connectivity and similarity by a flexible clustering algorithm based on short random walk model, which can obtain more structure information of software to find its multiple layers. Furthermore a new method called Multi-layer Propagation Gap is proposed to suggest stable layers of hierarchy clustering result as multiple layers of software system. Extensive experimental results illustrate the effectiveness and efficiency of DGHC in detecting hierarchy structure of software through comparison with various software clustering methods.
引用
收藏
页码:559 / 586
页数:28
相关论文
共 50 条
  • [21] A novel similarity-based modularity function for graph partitioning
    Feng, Zhidan
    Xu, Xiaowei
    Yuruk, Nurcan
    Schweiger, Thomas A. J.
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2007, 4654 : 385 - 396
  • [22] Similarity-based soft clustering algorithm for web documents
    School of Remote Sensing Information Engineering, Wuhan University, Wuhan 430079, China
    Jisuanji Gongcheng, 2006, 2 (59-61):
  • [23] Subspace Similarity-based Algorithm for Combine Multiple Clustering
    Xu, Sen
    Li, Xianfeng
    Chen, Rong
    Wu, Shuang
    Ni, Jun
    2013 SEVENTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR ENGINEERING AND SCIENCE (ICICSE 2013), 2013, : 69 - 76
  • [24] Data integration by fuzzy similarity-based hierarchical clustering
    Ciaramella, Angelo
    Nardone, Davide
    Staiano, Antonino
    BMC BIOINFORMATICS, 2020, 21 (Suppl 10)
  • [25] A Novel Similarity-based Modularity Function for Graph Partitioning
    Feng, Zhidan
    Xu, Xiaowei
    Yuruk, Nurcan
    Schweiger, Thomas
    CLUSTER CHALLENGES IN BIOLOGICAL NETWORKS, 2009, : 223 - +
  • [26] Similarity-based knowledge graph queries for recommendation retrieval
    Wenige, Lisa
    Ruhland, Johannes
    SEMANTIC WEB, 2019, 10 (06) : 1007 - 1037
  • [27] Data integration by fuzzy similarity-based hierarchical clustering
    Angelo Ciaramella
    Davide Nardone
    Antonino Staiano
    BMC Bioinformatics, 21
  • [28] Spectral analysis of text collection for similarity-based clustering
    Li, WY
    Ng, WK
    Lim, EP
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 389 - 393
  • [29] Predicting user preferences via similarity-based clustering
    Qin, Mian
    Buffett, Scott
    Fleming, Michael W.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2008, 5032 : 222 - +
  • [30] Spectral analysis of text collection for similarity-based clustering
    Li, WY
    Ng, WK
    Lim, EP
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 833 - 833