Graph clustering-based discretization of splitting and merging methods (GraphS and GraphM)

被引:20
|
作者
Sriwanna, Kittakorn [1 ]
Boongoen, Tossapon [1 ]
Iam-On, Natthakan [1 ]
机构
[1] Mae Fah Luang Univ, Sch Informat Technol, Phahon Yothin Rd, Muang 57100, Chiang Rai, Thailand
关键词
Multivariate discretization; Graph clustering; Normalized cuts; Normalized association; Data mining; ALGORITHM; TESTS;
D O I
10.1186/s13673-017-0103-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discretization plays a major role as a data preprocessing technique used in machine learning and data mining. Recent studies have focused on multivariate discretization that considers relations among attributes. The general goal of this method is to obtain the discrete data, which preserves most of the semantics exhibited by original continuous data. However, many techniques generate the final discrete data that may be less useful with natural groups of data not being maintained. This paper presents a novel graph clustering-based discretization algorithm that encodes different similarity measures into a graph representation of the examined data. The intuition allows more refined data-wise relations to be obtained and used with the effective graph clustering technique based on normalized association to discover nature graphs accurately. The goodness of this approach is empirically demonstrated over 30 standard datasets and 20 imbalanced datasets, compared with 11 well-known discretization algorithms using 4 classifiers. The results suggest the new approach is able to preserve the natural groups and usually achieve the efficiency in terms of classifier performance, and the desired number of intervals than the comparative methods.
引用
收藏
页数:39
相关论文
共 50 条
  • [41] Clustering-based force-directed algorithms for 3D graph visualization
    Jiawei Lu
    Yain-Whar Si
    The Journal of Supercomputing, 2020, 76 : 9654 - 9715
  • [42] A novel clustering-based anonymization approach for graph to achieve Privacy Preservation in Social Network
    Jiang, Huowen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 545 - 549
  • [43] Unsupervised Image Segmentation based Graph Clustering Methods
    Gammoudil, Islem
    Mahjoub, Mohamed Ali
    Guerdelli, Fethi
    COMPUTACION Y SISTEMAS, 2020, 24 (03): : 969 - 987
  • [44] Structural graph clustering on signed graphs: An index-based approach
    Zhao, Zheng
    Li, Wei
    Wang, Xiao
    Meng, Xiangxu
    Zheng, Xiangping
    Wang, Chenhao
    INFORMATION SCIENCES, 2025, 699
  • [45] Scheduling of large signal flow graphs based on metric graph clustering
    Depuydt, F.
    Goossens, G.
    van Meerbergen, J.
    Catthoor, F.
    De Man, H.
    Proceedings of the IFIP TC/WG10.5 Workshop on Logic and Architecture Synthesis, 1991,
  • [46] PSO with surrogate models for feature selection: static and dynamic clustering-based methods
    Hoai Bach Nguyen
    Xue, Bing
    Andreae, Peter
    MEMETIC COMPUTING, 2018, 10 (03) : 291 - 300
  • [47] PSO with surrogate models for feature selection: static and dynamic clustering-based methods
    Hoai Bach Nguyen
    Bing Xue
    Peter Andreae
    Memetic Computing, 2018, 10 : 291 - 300
  • [48] Efficient large scale global optimization through clustering-based population methods
    Schoen, Fabio
    Tigli, Luca
    COMPUTERS & OPERATIONS RESEARCH, 2021, 127
  • [49] Detection of Random Body Movements Using Clustering-Based Methods in Bioradar Systems
    Rouco, Andre
    Silva, Filipe
    Soares, Beatriz
    Albuquerque, Daniel
    Gouveia, Carolina
    Bras, Susana
    Pinho, Pedro
    INFORMATION, 2024, 15 (10)
  • [50] Clustering-based knowledge graphs and entity-relation representation improves the detection of at risk students
    Albreiki, Balqis
    Habuza, Tetiana
    Palakkal, Nishi
    Zaki, Nazar
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (06) : 6791 - 6820