Selection and Aggregation Techniques for Crowdsourced Semantic Annotation Task

被引:0
|
作者
Chowdhury, Shammur Absar [1 ]
Calvo, Marcos [2 ]
Ghosh, Arindam [1 ]
Stepanov, Evgeny A. [1 ]
Bayer, Ali Orkan [1 ]
Riccardi, Giuseppe [1 ]
Garcia, Fernando [2 ]
Sanchis, Emilio [2 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[2] Univ Politecn Valencia, Dept Sistemes Informat & Computacio, Valencia, Spain
关键词
Crowdsourcing; Annotation; Cross-language porting;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Crowdsourcing is an accessible and cost-effective alternative to traditional methods of collecting and annotating data. The application of crowdsourcing to simple tasks has been well investigated. However, complex tasks like semantic annotation transfer require workers to take simultaneous decisions on chunk segmentation and labeling while acquiring on-the-go domain specific knowledge. The increased task complexity may generate low judgment agreement and/or poor performance. The goal of this paper is to cope with these crowdsourcing requirements with semantic priming and unsupervised quality control mechanisms. We aim at an automatic quality control that takes into account different levels of workers' expertise and annotation task performance. We investigate the judgment selection and aggregation techniques on the task of cross-language semantic annotation transfer. We propose stochastic modeling techniques to estimate the task performance of a worker on a particular judgment with respect to the whole worker group. These estimates are used for the selection of the best judgments as well as weighted consensus-based annotation aggregation. We demonstrate that the technique is useful for increasing the quality of collected annotations.
引用
收藏
页码:2779 / 2783
页数:5
相关论文
共 50 条
  • [41] Cross-language transfer of semantic annotation via targeted crowdsourcing: task design and evaluation
    Evgeny A. Stepanov
    Shammur Absar Chowdhury
    Ali Orkan Bayer
    Arindam Ghosh
    Ioannis Klasinas
    Marcos Calvo
    Emilio Sanchis
    Giuseppe Riccardi
    Language Resources and Evaluation, 2018, 52 : 341 - 364
  • [42] Cross-language transfer of semantic annotation via targeted crowdsourcing: task design and evaluation
    Stepanov, Evgeny A.
    Chowdhury, Shammur Absar
    Bayer, Ali Orkan
    Ghosh, Arindam
    Klasinas, Ioannis
    Calvo, Marcos
    Sanchis, Emilio
    Riccardi, Giuseppe
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (01) : 341 - 364
  • [43] Adapting contentz-based image retrieval techniques for the semantic annotation of medical images
    Kumar, Ashnil
    Dyer, Shane
    Kim, Jinman
    Li, Changyang
    Leong, Philip H. W.
    Fulham, Michael
    Feng, Dagan
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2016, 49 : 37 - 45
  • [44] A Review of Semantic Annotation Models for Analysis of Healthcare Data Based on Data Mining Techniques
    Manonmani, M.
    Balakrishnan, Sarojini
    EMERGING RESEARCH IN DATA ENGINEERING SYSTEMS AND COMPUTER COMMUNICATIONS, CCODE 2019, 2020, 1054 : 231 - 238
  • [45] Mining confident supervision by prototypes discovering and annotation selection for weakly supervised semantic segmentation
    Zhou, Lei
    Chen, Huagui
    Wei, Yufeng
    Li, Xiaoxiao
    NEUROCOMPUTING, 2022, 501 : 420 - 435
  • [46] An Annotation Workbench for Semantic Annotation of Data Collection Instruments
    Sasse, Julia
    Fluck, Juliane
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 108 - 112
  • [47] A Negative Sample Image Selection Method Referring to Semantic Hierarchical Structure for Image Annotation
    Chan, Shan-Bin
    Yamana, Hayato
    Satoh, Shin'ichi
    2013 INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2013, : 162 - 167
  • [48] CTRAS: Crowdsourced Test Report Aggregation and Summarization
    Hao, Rui
    Feng, Yang
    Jones, James A.
    Li, Yuying
    Chen, Zhenyu
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 900 - 911
  • [49] Parting Crowds: Characterizing Divergent Interpretations in Crowdsourced Annotation Tasks
    Kairam, Sanjay
    Heer, Jeffrey
    ACM CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CSCW 2016), 2016, : 1637 - 1648
  • [50] SEMANTIC LOCATION EXTRACTION FROM CROWDSOURCED DATA
    Koswatte, S.
    Mcdougall, K.
    Liu, X.
    XXIII ISPRS CONGRESS, COMMISSION II, 2016, 41 (B2): : 543 - 547