Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引：0

作者：

Levinboim, Tomer ^{[1
]}

Thapliyal, Ashish V. ^{[1
]}

Sharma, Piyush ^{[1
]}

Soricut, Radu ^{[1
]}

机构：

[1] Google Res, Venice, CA 90291 USA

来源：

2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021) | 2021年

关键词：

LANGUAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.

引用

页码：3157 / 3166

页数：10

共 50 条

[31] Memory efficient large-scale image-based localization
Lu, Guoyu
Sebe, Nicu
Xu, Congfu
Kambhamettu, Chandra
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 479 - 503
[32] Estimation of DoA Based on Large-scale Virtual Array Data
Hung-Anh Nguyen
Mahler, Kim
Peter, Michael
Keusgen, Wilhelm
Eichler, Taro
Mellein, Heinz
2016 10TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2016,
[33] Large-scale near-duplicate image retrieval by kernel density estimation
Wei Tong
Fengjie Li
Rong Jin
Anil Jain
International Journal of Multimedia Information Retrieval, 2012, 1 (1) : 45 - 58
[34] Large-scale near-duplicate image retrieval by kernel density estimation
Tong, Wei
Li, Fengjie
Jin, Rong
Jain, Anil
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (01) : 45 - 58
[35] A Novel Image Retrieval Method for Image Based Localization in Large-Scale Environment
Yin, Xiliang
Ma, Lin
Tan, Xuezhi
2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2021,
[36] LARGE-SCALE IMAGE-PROCESSING
CHEN, CC
BULLETIN OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1987, 13 (06): : 15 - 16
[37] Large-scale methods in image deblurring
Hansen, Per Christian
Jensen, Toke Koldborg
APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 24 - +
[38] Large-Scale Image Retrieval with Elasticsearch
Amato, Giuseppe
Bolettieri, Paolo
Carrara, Fabio
Falchi, Fabrizio
Gennaro, Claudio
ACM/SIGIR PROCEEDINGS 2018, 2018, : 925 - 928
[39] Large-Scale Ideal Point Estimation
Peress, Michael
POLITICAL ANALYSIS, 2022, 30 (03) : 346 - 363
[40] COMPRESSIVE LARGE-SCALE IMAGE SENSING
Liang, Wei-Jie
Lin, Gang-Xuan
Lu, Chun-Shien
2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 378 - 382

← 1 2 3 4 5 →