Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引：0

作者：

Levinboim, Tomer ^{[1
]}

Thapliyal, Ashish V. ^{[1
]}

Sharma, Piyush ^{[1
]}

Soricut, Radu ^{[1
]}

机构：

[1] Google Res, Venice, CA 90291 USA

来源：

2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021) | 2021年

关键词：

LANGUAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.

引用

页码：3157 / 3166

页数：10

共 50 条

[1] STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset
Yoshikawa, Yuya
Shigeto, Yutaro
Takeuchi, Akikazu
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 417 - 421
[2] Absolute pose estimation of UAV based on large-scale satellite image
Wang, Hanyu
Shen, Qiang
Deng, Zilong
Cao, Xinyi
Wang, Xiaokang
CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (06) : 219 - 231
[3] Absolute pose estimation of UAV based on large-scale satellite image
Hanyu WANG
Qiang SHEN
Zilong DENG
Xinyi CAO
Xiaokang Wang
Chinese Journal of Aeronautics, 2024, 37 (06) : 219 - 231
[4] A Method for Large-Scale IPTV Quality Estimation
Qiu, Xiao-tong
Huang, Li-sheng
Jiang, Wen-jie
Xian, Ming
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNIQUES AND APPLICATIONS, AITA 2016, 2016, : 258 - 265
[5] MANNET: A LARGE-SCALE MANIPULATED IMAGE DETECTION DATASET AND BASELINE EVALUATIONS
Singh, Aditya
Chhabra, Saheb
Majumdar, Puspita
Singh, Richa
Vatsa, Mayank
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1780 - 1784
[6] Labeling Quality Problem for Large-Scale Image Recognition
Pilch, Agnieszka
Maciejewski, Henryk
NEW ADVANCES IN DEPENDABILITY OF NETWORKS AND SYSTEMS, DEPCOS-RELCOMEX 2022, 2022, 484 : 206 - 216
[7] A Large-Scale Database of Images and Captions for Automatic Face Naming
Oezcan, Mert
Jie, Luo
Ferrari, Vittorio
Caputo, Barbara
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[8] Large-scale image classification and nutrient estimation for Chinese dishes
Feng, Yihang
Wang, Yi
Wang, Xinhao
Bi, Jinbo
Xiao, Zhenlei
Luo, Yangchao
JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2025, 19
[9] Large-Scale Crowdsourcing Subjective Quality Evaluation of Learning-Based Image Coding
Upenik, Evgeniy
Testolina, Michela
Ascenso, Joao
Pereira, Fernando
Ebrahimi, Touradj
2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
[10] An high quality image scaling engine for large-scale LCD
Xiang, Zuquan
Zou, Xuecheng
Liu, Zhenglin
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 621 - +

← 1 2 3 4 5 →