Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引:0
|
作者
Levinboim, Tomer [1 ]
Thapliyal, Ashish V. [1 ]
Sharma, Piyush [1 ]
Soricut, Radu [1 ]
机构
[1] Google Res, Venice, CA 90291 USA
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.
引用
收藏
页码:3157 / 3166
页数:10
相关论文
共 50 条
  • [1] STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset
    Yoshikawa, Yuya
    Shigeto, Yutaro
    Takeuchi, Akikazu
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 417 - 421
  • [2] Absolute pose estimation of UAV based on large-scale satellite image
    Wang, Hanyu
    Shen, Qiang
    Deng, Zilong
    Cao, Xinyi
    Wang, Xiaokang
    CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (06) : 219 - 231
  • [3] Absolute pose estimation of UAV based on large-scale satellite image
    Hanyu WANG
    Qiang SHEN
    Zilong DENG
    Xinyi CAO
    Xiaokang Wang
    Chinese Journal of Aeronautics, 2024, 37 (06) : 219 - 231
  • [4] A Method for Large-Scale IPTV Quality Estimation
    Qiu, Xiao-tong
    Huang, Li-sheng
    Jiang, Wen-jie
    Xian, Ming
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNIQUES AND APPLICATIONS, AITA 2016, 2016, : 258 - 265
  • [5] MANNET: A LARGE-SCALE MANIPULATED IMAGE DETECTION DATASET AND BASELINE EVALUATIONS
    Singh, Aditya
    Chhabra, Saheb
    Majumdar, Puspita
    Singh, Richa
    Vatsa, Mayank
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1780 - 1784
  • [6] Labeling Quality Problem for Large-Scale Image Recognition
    Pilch, Agnieszka
    Maciejewski, Henryk
    NEW ADVANCES IN DEPENDABILITY OF NETWORKS AND SYSTEMS, DEPCOS-RELCOMEX 2022, 2022, 484 : 206 - 216
  • [7] A Large-Scale Database of Images and Captions for Automatic Face Naming
    Oezcan, Mert
    Jie, Luo
    Ferrari, Vittorio
    Caputo, Barbara
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [8] Large-scale image classification and nutrient estimation for Chinese dishes
    Feng, Yihang
    Wang, Yi
    Wang, Xinhao
    Bi, Jinbo
    Xiao, Zhenlei
    Luo, Yangchao
    JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2025, 19
  • [9] Large-Scale Crowdsourcing Subjective Quality Evaluation of Learning-Based Image Coding
    Upenik, Evgeniy
    Testolina, Michela
    Ascenso, Joao
    Pereira, Fernando
    Ebrahimi, Touradj
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [10] An high quality image scaling engine for large-scale LCD
    Xiang, Zuquan
    Zou, Xuecheng
    Liu, Zhenglin
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 621 - +