SOFTWARE ARCHITECTURE RECOVERY THROUGH SIMILARITY-BASED GRAPH CLUSTERING

被引:5
|
作者
Zhu, Jianlin [1 ]
Huang, Jin [2 ]
Zhou, Daicui [1 ]
Yin, Zhongbao [1 ]
Zhang, Guoping [3 ]
He, Qiang [4 ]
机构
[1] Cent China Normal Univ, Minist Educ, Key Lab Quark & Lepton Phys, Wuhan 430079, Peoples R China
[2] China Shipbldg Ind Corp, Res Inst 709, Wuhan 430070, Peoples R China
[3] Cent China Normal Univ, Coll Phys Sci & Technol, Wuhan 430079, Peoples R China
[4] Swinburne Univ Technol, Fac Informat & Commun Technol, Melbourne, Vic 3122, Australia
关键词
Software architecture recovery; hierarchy graph clustering; similarity-based clustering; multiple stable layers; ALGORITHMS;
D O I
10.1142/S0218194013500162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software architecture recovery is to gain the architectural level understanding of a software system while its architecture description does not exist. In recent years, researchers have adopted various software clustering techniques to detect hierarchical structure of software systems. Most graph clustering techniques focus on the connectivity between program elements, but unreasonably ignore the similarity which is also a key measure for finding elements of one module. In this paper we propose a novel hierarchy graph clustering algorithm DGHC, which considers both similarity and connectivity between program elements. During the transformation of program dependence graph edges representing similarity between elements are added. Then similar elements are grouped by density-based approaches. The alternative strategy is adopted to find groups of closely connected and similar elements. Meanwhile we adjust the contribution of connectivity and similarity by a flexible clustering algorithm based on short random walk model, which can obtain more structure information of software to find its multiple layers. Furthermore a new method called Multi-layer Propagation Gap is proposed to suggest stable layers of hierarchy clustering result as multiple layers of software system. Extensive experimental results illustrate the effectiveness and efficiency of DGHC in detecting hierarchy structure of software through comparison with various software clustering methods.
引用
收藏
页码:559 / 586
页数:28
相关论文
共 50 条
  • [41] Entity Similarity-Based Negative Sampling for Knowledge Graph Embedding
    Yao, Naimeng
    Liu, Qing
    Li, Xiang
    Yang, Yi
    Bai, Quan
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 73 - 87
  • [42] Graph semantic similarity-based automatic assessment for programming exercises
    Xiang, Chengguan
    Wang, Ying
    Zhou, Qiyun
    Yu, Zhen
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] A Fuzzy Similarity-based Clustering Optimized by Particle Swarm Optimization
    Chen Donghui
    Liu Zhijing
    Wang Zonghu
    CHINESE JOURNAL OF ELECTRONICS, 2013, 22 (03): : 461 - 465
  • [44] An efficient similarity-based validity index for kernel clustering algorithm
    Pu, Yun-Wei
    Zhu, Ming
    Jin, Wei-Dong
    Hu, Lai-Zhao
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1044 - 1049
  • [45] Similarity-based clustering of multifeature objects in visual working memory
    Gaeun Son
    Sang Chul Chong
    Attention, Perception, & Psychophysics, 2023, 85 : 2242 - 2256
  • [46] Topological Similarity-Based Feature Selection for Graph Classiinot signcation
    Han, Yongkoo
    Park, Kisung
    Guan, Donghai
    Halder, Sajal
    Lee, Young-Koo
    COMPUTER JOURNAL, 2015, 58 (09): : 1884 - 1893
  • [47] Similarity-based clustering of multifeature objects in visual working memory
    Son, Gaeun
    Chong, Sang Chul
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2023, 85 (07) : 2242 - 2256
  • [49] Performance studies of some similarity-based fuzzy Clustering algorithms
    School of Information Technology, Indian Institute of Technology, Kharagpur-721302, West Bengal, India
    不详
    Int. J. Perform. Eng., 2006, 2 (191-200):
  • [50] Similarity-based Clustering by Left-Stochastic Matrix Factorization
    Arora, Raman
    Gupta, Maya R.
    Kapila, Amol
    Fazel, Maryam
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 1715 - 1746