Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引:0
|
作者
Levinboim, Tomer [1 ]
Thapliyal, Ashish V. [1 ]
Sharma, Piyush [1 ]
Soricut, Radu [1 ]
机构
[1] Google Res, Venice, CA 90291 USA
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.
引用
收藏
页码:3157 / 3166
页数:10
相关论文
共 50 条
  • [31] Memory efficient large-scale image-based localization
    Lu, Guoyu
    Sebe, Nicu
    Xu, Congfu
    Kambhamettu, Chandra
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 479 - 503
  • [32] Estimation of DoA Based on Large-scale Virtual Array Data
    Hung-Anh Nguyen
    Mahler, Kim
    Peter, Michael
    Keusgen, Wilhelm
    Eichler, Taro
    Mellein, Heinz
    2016 10TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2016,
  • [33] Large-scale near-duplicate image retrieval by kernel density estimation
    Wei Tong
    Fengjie Li
    Rong Jin
    Anil Jain
    International Journal of Multimedia Information Retrieval, 2012, 1 (1) : 45 - 58
  • [34] Large-scale near-duplicate image retrieval by kernel density estimation
    Tong, Wei
    Li, Fengjie
    Jin, Rong
    Jain, Anil
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (01) : 45 - 58
  • [35] A Novel Image Retrieval Method for Image Based Localization in Large-Scale Environment
    Yin, Xiliang
    Ma, Lin
    Tan, Xuezhi
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2021,
  • [36] LARGE-SCALE IMAGE-PROCESSING
    CHEN, CC
    BULLETIN OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1987, 13 (06): : 15 - 16
  • [37] Large-scale methods in image deblurring
    Hansen, Per Christian
    Jensen, Toke Koldborg
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 24 - +
  • [38] Large-Scale Image Retrieval with Elasticsearch
    Amato, Giuseppe
    Bolettieri, Paolo
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 925 - 928
  • [39] Large-Scale Ideal Point Estimation
    Peress, Michael
    POLITICAL ANALYSIS, 2022, 30 (03) : 346 - 363
  • [40] COMPRESSIVE LARGE-SCALE IMAGE SENSING
    Liang, Wei-Jie
    Lin, Gang-Xuan
    Lu, Chun-Shien
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 378 - 382