Multi-label symbolic value partitioning through random walks

被引:5
|
作者
Wen, Liu-Ying [1 ]
Luo, Chao-Guang [1 ]
Wu, Wei-Zhi [2 ,3 ]
Min, Fan [1 ]
机构
[1] Southwest Petr Univ, Sch Comp Sci, Chengdu 610500, Peoples R China
[2] Zhejiang Ocean Univ, Sch Math Phys & Informat Sci, Zhoushan 316022, Peoples R China
[3] Zhejiang Ocean Univ, Key Lab Oceanog Big Data Min & Applicat Zhejiang, Zhoushan 316022, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Random walk; Symbolic value partition; Weighted graph; FEATURE-SELECTION; FEATURE-EXTRACTION; CLASSIFICATION; TRANSFORMATION;
D O I
10.1016/j.neucom.2020.01.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection and symbolic value partitioning are effective knowledge reduction techniques in the field of data mining. A large body of feature selection methods has been proposed for multi-label data. By contrast, symbolic value partitioning for such data has not been studied. In this paper, we propose the multi-label symbolic value partitioning through random walks algorithm with two stages. In the first stage, an undirected weighted graph is constructed for each attribute. Each node corresponds to an attribute value and the weight of each edge corresponds to the similarity between two nodes. Similarity is defined based on the attribute value distribution for each label. In the second stage, a random walk algorithm is used to cluster attribute values. The average weight serves as the separation operator to sharpen the inter-cluster edges. We tested the new algorithm and seven popular feature selection algorithms on 13 datasets. The experimental results demonstrate the effectiveness of the proposed algorithm in reducing the data size and improving classification accuracy. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:195 / 209
页数:15
相关论文
共 50 条
  • [1] Exploring Label Correlations for Partitioning the Label Space in Multi-label Classification
    Gatto, Elaine Cecilia
    Ferrandin, Mauri
    Cerri, Ricardo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [2] MLRF: Multi-label Classification Through Random Forest with Label-Set Partition
    Liu, Feng
    Zhang, Xiaofeng
    Ye, Yunming
    Zhao, Yahong
    Li, Yan
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 407 - 418
  • [3] Hierarchical partitioning of the output space in multi-label data
    Papanikolaou, Yannis
    Tsoumakas, Grigorios
    Katakis, Ioannis
    DATA & KNOWLEDGE ENGINEERING, 2018, 116 : 42 - 60
  • [4] Local Multi-label Explanations for Random Forest
    Mylonas, Nikolaos
    Mollas, Ioannis
    Bassiliades, Nick
    Tsoumakas, Grigorios
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 369 - 384
  • [5] Multi-label Random Subspace Ensemble Classification
    Bi, Fan
    Zhu, Jianan
    Feng, Yang
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [6] Multi-label Classification Using Random Label Subset Selections
    Breskvar, Martin
    Kocev, Dragi
    Dzeroski, Saso
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 108 - 115
  • [7] Predicting Drug-Target Interactions With Multi-Label Classification and Label Partitioning
    Pliakos, Konstantinos
    Vens, Celine
    Tsoumakas, Grigorios
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (04) : 1596 - 1607
  • [8] Multi-label Classification using Random Walk with Restart
    Liu, Jinhong
    Yang, Juan
    2017 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2017, : 206 - 212
  • [9] iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2025, 114 (02)
  • [10] On the Value of Head Labels in Multi-Label Text Classification
    Wang, Haobo
    Peng, Cheng
    Dong, Hede
    Feng, Lei
    Liu, Weiwei
    Hu, Tianlei
    Chen, Ke
    Chen, Gang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)