A classification algorithm based on weighted ML-kNN for multi-label data

被引:0
|
作者
Jiang M. [1 ]
Du L. [1 ]
Wu J. [1 ]
Zhang M. [1 ]
Gong Z. [1 ]
机构
[1] Institute of Software and Intelligent Technology, Hangzhou Dianzi University, Hangzhou
来源
Int. J. Internet Manuf. Serv. | 2019年 / 4卷 / 326-342期
关键词
K-nearest neighbour; ML-kNN; Multi-label learning; Weighted multi-label kNN; WML-kNN;
D O I
10.1504/IJIMS.2019.103861
中图分类号
学科分类号
摘要
The ML-kNN algorithm uses naive Bayesian classification to modify the traditional kNN algorithm to solve multi-label classification problems. However, the ML-kNN algorithm is prone to misjudgement or incomplete judgment of the unseen instance's label set in two special cases: when the number of labels in the training set is not balanced and when the training instances are unevenly distributed in space. Therefore, a weighted ML-kNN algorithm (i.e., wML-kNN) is proposed in this paper. The main idea is to assign different weights to each label according to the proportion of labels and mutual information of the spatial distribution of unseen instances to training instances. This method can reduce the probability of misjudgement of the unseen instance's label set. A comparative study was conducted on four multi-label datasets that included review classification and three other published benchmark multi-label datasets: Yeast gene function analysis, natural scene classification, and musical sentiment classification. The results show that the performance of the wML-kNN algorithm is better than the other four multi-label learning algorithms, including ML-kNN. © 2019 Inderscience Enterprises Ltd.
引用
收藏
页码:326 / 342
页数:16
相关论文
共 50 条
  • [1] Multi-label Classification of Twitter Data Using Modified ML-KNN
    Srivastava, Saurabh Kumar
    Singh, Sandeep Kumar
    ADVANCES IN DATA AND INFORMATION SCIENCES, ICDIS 2017, VOL 2, 2019, 39 : 31 - 41
  • [2] Research on multi-label user classification of social media based on ML-KNN algorithm
    Huang, Anzhong
    Xu, Rui
    Chen, Yu
    Guo, Meiwen
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2023, 188
  • [3] An Adaptation of the ML-kNN Algorithm to Predict the Number of Classes in Hierarchical Multi-label Classification
    Almeida, Thissiany Beatriz
    Borges, Helyane Bronoski
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2017), 2017, 10571 : 77 - 88
  • [4] An Improved ML-kNN Multi-label Classification Model Based on Feature Dimensionality Reduction
    Li, Zhi-qiang
    Cao, Shuai-yi
    Guo, Hong-chen
    INTERNATIONAL CONFERENCE ON COMPUTER, MECHATRONICS AND ELECTRONIC ENGINEERING (CMEE 2016), 2016,
  • [5] ML-KNN: A lazy learning approach to multi-label leaming
    Zhang, Min-Ling
    Zhou, Zhi-Hua
    PATTERN RECOGNITION, 2007, 40 (07) : 2038 - 2048
  • [6] A multi-label social short text classification method based on contrastive learning and improved ml-KNN
    Tian, Gang
    Wang, Jiachang
    Wang, Rui
    Zhao, Guangxin
    He, Cheng
    EXPERT SYSTEMS, 2024, 41 (07)
  • [7] Multi-Label Code Error Classification Using CodeT5 and ML-KNN
    Amin, Md Faizul Ibne
    Shirafuji, Atsushi
    Rahman, Md Mostafizer
    Watanobe, Yutaka
    IEEE ACCESS, 2024, 12 : 100805 - 100820
  • [8] Ensemble of ML-KNN for classification algorithm recommendation
    Zhu, Xiaoyan
    Ying, Chenzhen
    Wang, Jiayin
    Li, Jiaxuan
    Lai, Xin
    Wang, Guangtao
    KNOWLEDGE-BASED SYSTEMS, 2021, 221
  • [9] An Improved ML-kNN Algorithm by Fusing Nearest Neighbor Classification
    Zeng, Yong
    Fu, Hao-ming
    Zhang, Yu-ping
    Zhao, Xi-ya
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER SCIENCE (AICS 2016), 2016, : 193 - 198
  • [10] A Weighted Ensemble Classification Algorithm Based on Nearest Neighbors for Multi-Label Data Stream
    Wu, Hongxin
    Han, Meng
    Chen, Zhiqiang
    Li, Muhang
    Zhang, Xilong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (05)