A classification algorithm based on weighted ML-kNN for multi-label data

被引:0
|
作者
Jiang M. [1 ]
Du L. [1 ]
Wu J. [1 ]
Zhang M. [1 ]
Gong Z. [1 ]
机构
[1] Institute of Software and Intelligent Technology, Hangzhou Dianzi University, Hangzhou
来源
Int. J. Internet Manuf. Serv. | 2019年 / 4卷 / 326-342期
关键词
K-nearest neighbour; ML-kNN; Multi-label learning; Weighted multi-label kNN; WML-kNN;
D O I
10.1504/IJIMS.2019.103861
中图分类号
学科分类号
摘要
The ML-kNN algorithm uses naive Bayesian classification to modify the traditional kNN algorithm to solve multi-label classification problems. However, the ML-kNN algorithm is prone to misjudgement or incomplete judgment of the unseen instance's label set in two special cases: when the number of labels in the training set is not balanced and when the training instances are unevenly distributed in space. Therefore, a weighted ML-kNN algorithm (i.e., wML-kNN) is proposed in this paper. The main idea is to assign different weights to each label according to the proportion of labels and mutual information of the spatial distribution of unseen instances to training instances. This method can reduce the probability of misjudgement of the unseen instance's label set. A comparative study was conducted on four multi-label datasets that included review classification and three other published benchmark multi-label datasets: Yeast gene function analysis, natural scene classification, and musical sentiment classification. The results show that the performance of the wML-kNN algorithm is better than the other four multi-label learning algorithms, including ML-kNN. © 2019 Inderscience Enterprises Ltd.
引用
收藏
页码:326 / 342
页数:16
相关论文
共 50 条
  • [31] EvoImp: Multiple Imputation of Multi-label Classification data with a genetic algorithm
    Jacob Junior, Antonio Fernando Lavareda
    do Carmo, Fabricio Almeida
    de Santana, Adamo Lima
    Santana, Ewaldo Eder Carvalho
    Lobato, Fabio Manoel Franca
    PLOS ONE, 2024, 19 (01):
  • [32] A Double Weighted Naive Bayes for Multi-label Classification
    Yan, Xuesong
    Li, Wei
    Wu, Qinghua
    Sheng, Victor S.
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 382 - 389
  • [33] An improved multi-label classification algorithm BRkNN
    Geng, Xia
    Tang, Yujia
    Zhu, Yuquan
    Cheng, Geng
    Journal of Information and Computational Science, 2014, 11 (16): : 5927 - 5936
  • [34] A Joint Embedded Multi-label Classification Algorithm
    Liu H.-T.
    Leng X.-Y.
    Wang L.-L.
    Zhao P.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (10): : 1969 - 1982
  • [35] A Supervised Laplacian Eigenmap Algorithm for Visualization of Multi-label Data: SLE-ML
    Tai, Mariko
    Kudo, Mineichi
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 525 - 534
  • [36] Multi-Label Classification Based on Associations
    Alazaidah, Raed
    Samara, Ghassan
    Almatarneh, Sattam
    Hassan, Mohammad
    Aljaidi, Mohammad
    Mansur, Hasan
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [37] Correlation-Based Weighted K-Labelsets for Multi-label Classification
    Xu, Jingyang
    Ma, Jun
    WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 408 - 419
  • [38] A Survey on Multi-Label Data Stream Classification
    Zheng, Xiulin
    Li, Peipei
    Chu, Zhe
    Hu, Xuegang
    IEEE ACCESS, 2020, 8 : 1249 - 1275
  • [39] A New Kernel-Based Classification Algorithm for Multi-label Datasets
    Lahouari Ghouti
    Arabian Journal for Science and Engineering, 2016, 41 : 759 - 771
  • [40] A Novel Learning-Based PLST Algorithm for Multi-Label Classification
    Ebrahimi, Seyed Hossein Seyed
    Majidzadeh, Kambiz
    Gharehchopogh, Farhad Soleimanian
    IETE JOURNAL OF RESEARCH, 2024, 70 (05) : 4702 - 4720