Label distribution similarity-based noise correction for crowdsourcing

被引:4
|
作者
Ren, Lijuan [1 ]
Jiang, Liangxiao [1 ]
Zhang, Wenjun [1 ]
Li, Chaoqun [2 ,3 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] Minist Educ, Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
crowdsourcing learning; noise correction; label distribution similarity; kullback-leibler divergence; QUALITY; TOOL;
D O I
10.1007/s11704-023-2751-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In crowdsourcing scenarios, we can obtain each instance's multiple noisy labels from different crowd workers and then infer its integrated label via label aggregation. In spite of the effectiveness of label aggregation methods, there still remains a certain level of noise in the integrated labels. Thus, some noise correction methods have been proposed to reduce the impact of noise in recent years. However, to the best of our knowledge, existing methods rarely consider an instance's information from both its features and multiple noisy labels simultaneously when identifying a noise instance. In this study, we argue that the more distinguishable an instance's features but the noisier its multiple noisy labels, the more likely it is a noise instance. Based on this premise, we propose a label distribution similarity-based noise correction (LDSNC) method. To measure whether an instance's features are distinguishable, we obtain each instance's predicted label distribution by building multiple classifiers using instances' features and their integrated labels. To measure whether an instance's multiple noisy labels are noisy, we obtain each instance's multiple noisy label distribution using its multiple noisy labels. Then, we use the Kullback-Leibler (KL) divergence to calculate the similarity between the predicted label distribution and multiple noisy label distribution and define the instance with the lower similarity as a noise instance. The extensive experimental results on 34 simulated and four real-world crowdsourced datasets validate the effectiveness of our method.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] On Similarity-Based Unfolding
    Moreno, Gines
    Penabad, Jaime
    Antonio Riaza, Jose
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2017), 2017, 10564 : 420 - 426
  • [22] Similarity-based Fisherfaces
    Delgado-Gomez, David
    Fagertun, Jens
    Ersboll, Bjarne
    Sukno, Federico M.
    Frangi, Alejandro F.
    PATTERN RECOGNITION LETTERS, 2009, 30 (12) : 1110 - 1116
  • [23] Similarity-based large-scale distribution mapping of orchids
    Remm, Kalle
    Remm, Liina
    BIODIVERSITY AND CONSERVATION, 2009, 18 (06) : 1629 - 1647
  • [24] Similarity-based document distribution for efficient distributed information retrieval
    Herschel, Sven
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 99 - 110
  • [25] Similarity-based large-scale distribution mapping of orchids
    Kalle Remm
    Liina Remm
    Biodiversity and Conservation, 2009, 18 : 1629 - 1647
  • [26] Noise Suppression With Similarity-Based Self-Supervised Deep Learning
    Niu, Chuang
    Li, Mengzhou
    Fan, Fenglei
    Wu, Weiwen
    Guo, Xiaodong
    Lyu, Qing
    Wang, Ge
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (06) : 1590 - 1602
  • [27] Multiplicative Noise Removal via Nonlocal Similarity-Based Sparse Representation
    Chen, Lixia
    Liu, Xujiao
    Wang, Xuewen
    Zhu, Pingfang
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2016, 54 (02) : 199 - 215
  • [28] Multiplicative Noise Removal via Nonlocal Similarity-Based Sparse Representation
    Lixia Chen
    Xujiao Liu
    Xuewen Wang
    Pingfang Zhu
    Journal of Mathematical Imaging and Vision, 2016, 54 : 199 - 215
  • [29] Similarity-Based Label Inference Attack Against Training and Inference of Split Learning
    Liu, Junlin
    Lyu, Xinchen
    Cui, Qimei
    Tao, Xiaofeng
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 2881 - 2895
  • [30] Gaussian similarity-based adaptive dynamic label assignment for tiny object detection
    Fu, Ronghao
    Chen, Chengcheng
    Yan, Shuang
    Heidari, Ali Asghar
    Wang, Xianchang
    Escorcia-Gutierrez, Jose
    Mansour, Romany F.
    Chene, Huiling
    NEUROCOMPUTING, 2023, 543