Software component clustering and classification using novel similarity measure

被引:7
|
作者
Srinivas, Chintakindi [1 ]
Radhakrishna, Vangipuram [2 ]
Rao, C. V. Guru [3 ]
机构
[1] Kakatiya Inst Technol, Dept Comp Sci & Engn, Warangal, Andhra Pradesh, India
[2] VNR VJIET Autonomous, Dept Informat Technol, Hyderabad, Andhra Pradesh, India
[3] SR Engn Coll Autonomous, Comp Sci & Engn, Warangal, Andhra Pradesh, India
关键词
software components; similarity; component vector; clustering;
D O I
10.1016/j.protcy.2015.02.124
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The similarity measures such as Euclidean, Jaccard, Cosine, Manhattan etc present in the literature only consider the count of the features but does not consider the feature distribution and the degree of commonality. There is a significant research carried out for designing new similarity measures which can accurately find the similarity between any two software components. The distribution of component features in the software components has important contribution in evaluating their degree of similarity. This is the key idea for the design of the proposed measure. The main objective of this research is to first design an efficient similarity measure which essentially considers the distribution of the features over the entire input. We then cany out the analysis for worst case, average case and best case situations. The proposed measure is Gaussian based and preserves the properties of Gaussian function and can be used for clustering and classification of software components. (C) 2015 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:866 / 873
页数:8
相关论文
共 50 条
  • [1] A Similarity Measure for Text Classification and Clustering
    Lin, Yung-Shen
    Jiang, Jung-Yi
    Lee, Shie-Jue
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) : 1575 - 1590
  • [2] A novel similarity measure for data clustering
    Yao, Yuhui
    Chen, Yan Qiu
    Chen, Lihui
    Intelligent Data Analysis, 2000, 4 (05) : 421 - 431
  • [3] An Improved Similarity Measure for Text Clustering and Classification
    Reddy, G. Suresh
    Kanth, T. V. Rajini
    Rao, A. Ananda
    ADVANCED SCIENCE LETTERS, 2015, 21 (11) : 3583 - 3590
  • [4] A Comment on "A Similarity Measure for Text Classification and Clustering"
    Nagwani, Naresh Kumar
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) : 2589 - 2590
  • [5] A novel similarity measure for compression and classification
    Ozturk, Y
    Abut, H
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2845 - 2848
  • [6] A Frequent Term BasedText Clustering Approach Using Novel Similarity Measure
    Reddy, G. Suresh
    Rajinikanth, T. V.
    Rao, A. Ananda
    SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 495 - 499
  • [7] Unsupervised multistage image classification using hierarchical clustering with a Bayesian similarity measure
    Lee, S
    Crawford, MM
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (03) : 312 - 320
  • [8] A Modified Gaussian Similarity Measure for Clustering Software Components and Documents
    Radhakrishna, Vangipuram
    Srinivas, Chintakindi
    GuruRao, C. V.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE INFORMATION SYSTEMS AND DESIGN OF COMMUNICATION (ISDOC2014), 2014, : 99 - 104
  • [9] Document Clustering Using Hybrid XOR Similarity Function for Efficient Software Component Reuse
    Radhakrishna, Vangipuram
    Srinivas, C.
    Rao, C. V. Guru
    FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 121 - 128
  • [10] Using a similarity measure for credible classification
    Subasi, M.
    Subasi, E.
    Anthony, M.
    Hammer, P. L.
    DISCRETE APPLIED MATHEMATICS, 2009, 157 (05) : 1104 - 1112