A classification algorithm based on weighted ML-kNN for multi-label data

被引:0
|
作者
Jiang M. [1 ]
Du L. [1 ]
Wu J. [1 ]
Zhang M. [1 ]
Gong Z. [1 ]
机构
[1] Institute of Software and Intelligent Technology, Hangzhou Dianzi University, Hangzhou
来源
Int. J. Internet Manuf. Serv. | 2019年 / 4卷 / 326-342期
关键词
K-nearest neighbour; ML-kNN; Multi-label learning; Weighted multi-label kNN; WML-kNN;
D O I
10.1504/IJIMS.2019.103861
中图分类号
学科分类号
摘要
The ML-kNN algorithm uses naive Bayesian classification to modify the traditional kNN algorithm to solve multi-label classification problems. However, the ML-kNN algorithm is prone to misjudgement or incomplete judgment of the unseen instance's label set in two special cases: when the number of labels in the training set is not balanced and when the training instances are unevenly distributed in space. Therefore, a weighted ML-kNN algorithm (i.e., wML-kNN) is proposed in this paper. The main idea is to assign different weights to each label according to the proportion of labels and mutual information of the spatial distribution of unseen instances to training instances. This method can reduce the probability of misjudgement of the unseen instance's label set. A comparative study was conducted on four multi-label datasets that included review classification and three other published benchmark multi-label datasets: Yeast gene function analysis, natural scene classification, and musical sentiment classification. The results show that the performance of the wML-kNN algorithm is better than the other four multi-label learning algorithms, including ML-kNN. © 2019 Inderscience Enterprises Ltd.
引用
收藏
页码:326 / 342
页数:16
相关论文
共 50 条
  • [21] A Multi-Label Classification Algorithm Based on Label-Specific Features
    QU Huaqiao1
    2.Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education
    WuhanUniversityJournalofNaturalSciences, 2011, 16 (06) : 520 - 524
  • [22] Hierarchical text classification with multi-label contrastive learning and KNN
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    He, Yueshun
    Tan, Hai
    He, Yanxiang
    NEUROCOMPUTING, 2024, 577
  • [23] An Improved ML-kNN Approach Based on Coupled Similarity
    Yang, Xiaodan
    Zhou, Lihua
    Wang, Lizhen
    WEB TECHNOLOGIES AND APPLICATIONS: APWEB 2016 WORKSHOPS, WDMA, GAP, AND SDMA, 2016, 9865 : 77 - 89
  • [24] A Double Weighted Naive Bayes with Niching Cultural Algorithm for Multi-Label Classification
    Yan, Xuesong
    Wu, Qinghua
    Sheng, Victor S.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (06)
  • [25] ML-EC2: An algorithm for multi-label email classification using clustering
    Sharaff A.
    Nagwani N.K.
    International Journal of Web-Based Learning and Teaching Technologies, 2020, 15 (02): : 19 - 33
  • [26] Multi-label classification algorithm research based on swarm intelligence
    Qinghua Wu
    Hanmin Liu
    Xuesong Yan
    Cluster Computing, 2016, 19 : 2075 - 2085
  • [27] Multi-label classification algorithm research based on swarm intelligence
    Wu, Qinghua
    Liu, Hanmin
    Yan, Xuesong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2075 - 2085
  • [28] OCBMLC: An Overlapping Clustering Based Multi-Label Classification Algorithm
    Peng, Liwen
    Liu, Yongguo
    Liao, Huan
    Zhang, Peng
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 113 - 116
  • [29] Intelligent ZHENG Classification of Hypertension Depending on ML-kNN and Information Fusion
    Li, Guo-Zheng
    Yan, Shi-Xing
    You, Mingyu
    Sun, Sheng
    Ou, Aihua
    EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2012, 2012
  • [30] A multi-label classification algorithm based on random walk model
    Zheng W.
    Wang C.-K.
    Liu Z.
    Wang J.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1418 - 1426