Self-paced annotations of crowd workers

被引:0
|
作者
Xiangping Kang
Guoxian Yu
Carlotta Domeniconi
Jun Wang
Wei Guo
Yazhou Ren
Xiayan Zhang
Lizhen Cui
机构
[1] Shandong University,School of Software
[2] Shandong University,Joint SDU
[3] George Mason University,NTU Centre for Artificial Intelligence Research (C
[4] University of Electron Science and Technology of China,FAIR)
[5] Shenzhen Polytechnic,Department of Computer Science
来源
关键词
Crowdsourcing; Self-paced learning; Quality control; Task assignment; Truth inference;
D O I
暂无
中图分类号
学科分类号
摘要
Crowdsourcing can harness human intelligence to handle computer-hard tasks in a relatively economic way. The collected answers from various crowd workers are of different qualities, due to the task difficulty, worker capability, incentives and other factors. To maintain high-quality answers while reducing the cost, various strategies have been developed by modeling tasks, workers, or both. Nevertheless, they typically deem that the capability of workers is static when assigning/completing all the tasks. However, in actual fact, crowd workers can improve their capability by gradually completing easy to hard tasks, alike human beings’ intrinsic self-paced learning ability. In this paper, we study crowdsourcing with self-paced workers, whose capability can be progressively improved as they scrutinize and complete tasks from to easy to hard. We introduce a Self-paced Crowd-worker model (SPCrowder). In SPCrowder, workers firstly do a set of golden tasks with known truths, which serve as feedbacks to assist workers capturing the raw modes of tasks and to stimulate the self-paced learning. This also helps to estimate workers’ quality and tasks’ difficulty. SPCrowder then uses a task difficulty model to dynamically measure the difficulty of tasks and rank them from easy to hard and assign tasks to self-paced workers by maximizing a benefit criterion. By doing so, a normal worker can be capable to handle hard tasks after completing some easier and related tasks. We conducted extensive experiments on semi-simulated and real crowdsourcing datasets, SPCrowder outperforms competitive methods in quality control and budget saving. Crowd workers indeed hold the self-paced learning ability, which boosts the quality and save the budget.
引用
收藏
页码:3235 / 3263
页数:28
相关论文
共 50 条
  • [1] Self-paced annotations of crowd workers
    Kang, Xiangping
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Guo, Wei
    Ren, Yazhou
    Zhang, Xiayan
    Cui, Lizhen
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (12) : 3235 - 3263
  • [2] Crowdsourcing with Self-paced Workers
    Kang, Xiangping
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Guo, Wei
    Ren, Yazhou
    Cui, Lizhen
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 280 - 289
  • [3] Self-Paced AutoEncoder
    Yu, Tingzhao
    Guo, Chaoxu
    Wang, Lingfeng
    Xiang, Shiming
    Pan, Chunhong
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (07) : 1054 - 1058
  • [4] SELF-PACED CHEMISTRY
    HAWKES, SJ
    JOURNAL OF CHEMICAL EDUCATION, 1984, 61 (06) : 564 - 565
  • [5] SELF-PACED TRAINING
    FRIEDMAN, R
    BYTE, 1995, 20 (06): : 45 - 45
  • [6] A DEVICE FOR PACED AND SELF-PACED VISUAL DISPLAYS
    ARENBERG, D
    SURWILLO, WW
    AMERICAN JOURNAL OF PSYCHOLOGY, 1966, 79 (01): : 120 - &
  • [7] SELF-PACED SUBSPACE CLUSTERING
    Liu, Youfa
    Du, Bo
    Zhang, Lefei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 350 - 355
  • [8] Self-Paced Network Embedding
    Gao, Hongchang
    Huang, Heng
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1406 - 1415
  • [9] HYPOXIA AND SELF-PACED WORK
    GERBEN, MJ
    HOUSE, JL
    PERCEPTUAL AND MOTOR SKILLS, 1969, 28 (03) : 995 - &
  • [10] SELF-PACED INSTRUCTION - REVIEW
    ALLISON, E
    JOURNAL OF ECONOMIC EDUCATION, 1975, 7 (01): : 5 - 12