Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引:0
|
作者
Levinboim, Tomer [1 ]
Thapliyal, Ashish V. [1 ]
Sharma, Piyush [1 ]
Soricut, Radu [1 ]
机构
[1] Google Res, Venice, CA 90291 USA
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.
引用
收藏
页码:3157 / 3166
页数:10
相关论文
共 50 条
  • [21] When should we conduct large-scale evaluations?
    Holmes, John
    ADDICTION, 2023, 118 (09) : 1622 - 1623
  • [22] Beyond accountability: learning from large-scale evaluations
    Boerma, Ties
    de Zoysa, Isabelle
    LANCET, 2011, 378 (9803): : 1610 - 1612
  • [23] Researches on Evaluations of Large-scale Complex Networks Topologies
    Xu Ye
    Chong Fei
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 577 - 583
  • [24] Memory efficient large-scale image-based localization
    Guoyu Lu
    Nicu Sebe
    Congfu Xu
    Chandra Kambhamettu
    Multimedia Tools and Applications, 2015, 74 : 479 - 503
  • [25] EFFECTS OF LARGE-SCALE EVALUATIONS ON SCHOOL CURRICULUM ORGANIZATION
    Ferreira, Livia Andrade
    Ferraz Pereira, Maria Simone
    NUANCES-ESTUDOS SOBRE EDUCACAO, 2019, 30 (01): : 327 - 344
  • [26] Large-scale evaluations as a biopolitical and disciplinary power device
    Tedeschi, Sirley Lizott
    Pavan, Ruth
    EDUCACAO, 2020, 45
  • [27] Large-Scale Image Retrieval Based on Compressed Camera Identification
    Valsesia, Diego
    Coluccia, Giulio
    Bianchi, Tiziano
    Magli, Enrico
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (09) : 1439 - 1449
  • [28] Large-Scale Image Retrieval Method Based on Vocabulary Tree
    Qi Jin
    Zhao Jian
    Xie Yu
    Chen Xiao-ning
    12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 219 - 223
  • [29] Very Large-Scale Image Retrieval Based on Local Features
    Yin, Chang-Qing
    Mao, Wei
    Jiang, Wei
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 242 - +
  • [30] COMPACT FEATURE BASED CLUSTERING FOR LARGE-SCALE IMAGE RETRIEVAL
    Liang, Yan
    Dong, Le
    Xie, Shanshan
    Lv, Na
    Xu, Zongyi
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,