Quality Estimation for Image Captions Based on Large-scale Human Evaluations

被引：0

作者：

Levinboim, Tomer ^{[1
]}

Thapliyal, Ashish V. ^{[1
]}

Sharma, Piyush ^{[1
]}

Soricut, Radu ^{[1
]}

机构：

[1] Google Res, Venice, CA 90291 USA

来源：

2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021) | 2021年

关键词：

LANGUAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-truth references, so that it can be applied at prediction time to detect low-quality captions produced on previously unseen images. For this task, we develop a human evaluation process that collects coarse-grained caption annotations from crowdsourced users, which is then used to collect a large scale dataset spanning more than 600k caption quality ratings. We then carefully validate the quality of the collected ratings and establish baseline models for this new QE task. Finally, we further collect fine-grained caption quality annotations from trained raters, and use them to demonstrate that QE models trained over the coarse ratings can effectively detect and filter out low-quality image captions, thereby improving the user experience from captioning systems.

引用

页码：3157 / 3166

页数：10

共 50 条

[21] When should we conduct large-scale evaluations?
Holmes, John
ADDICTION, 2023, 118 (09) : 1622 - 1623
[22] Beyond accountability: learning from large-scale evaluations
Boerma, Ties
de Zoysa, Isabelle
LANCET, 2011, 378 (9803): : 1610 - 1612
[23] Researches on Evaluations of Large-scale Complex Networks Topologies
Xu Ye
Chong Fei
ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 577 - 583
[24] Memory efficient large-scale image-based localization
Guoyu Lu
Nicu Sebe
Congfu Xu
Chandra Kambhamettu
Multimedia Tools and Applications, 2015, 74 : 479 - 503
[25] EFFECTS OF LARGE-SCALE EVALUATIONS ON SCHOOL CURRICULUM ORGANIZATION
Ferreira, Livia Andrade
Ferraz Pereira, Maria Simone
NUANCES-ESTUDOS SOBRE EDUCACAO, 2019, 30 (01): : 327 - 344
[26] Large-scale evaluations as a biopolitical and disciplinary power device
Tedeschi, Sirley Lizott
Pavan, Ruth
EDUCACAO, 2020, 45
[27] Large-Scale Image Retrieval Based on Compressed Camera Identification
Valsesia, Diego
Coluccia, Giulio
Bianchi, Tiziano
Magli, Enrico
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (09) : 1439 - 1449
[28] Large-Scale Image Retrieval Method Based on Vocabulary Tree
Qi Jin
Zhao Jian
Xie Yu
Chen Xiao-ning
12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 219 - 223
[29] Very Large-Scale Image Retrieval Based on Local Features
Yin, Chang-Qing
Mao, Wei
Jiang, Wei
EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 242 - +
[30] COMPACT FEATURE BASED CLUSTERING FOR LARGE-SCALE IMAGE RETRIEVAL
Liang, Yan
Dong, Le
Xie, Shanshan
Lv, Na
Xu, Zongyi
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,

← 1 2 3 4 5 →