IMAGE CAPTIONING WITH ATTRIBUTE REFINEMENT

被引：0

作者：

Huang, Yiqing ^{[1
]}

Li, Cong ^{[1
]}

Li, Tianpeng ^{[1
]}

Wan, Weitao ^{[1
]}

Chen, Jiansheng ^{[1
]}

机构：

[1] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

基金：

中国国家自然科学基金;

关键词：

Image captioning; attribute recognition; Semantic attention; Deep Neural Network; Conditional Random Field;

D O I：

10.1109/icip.2019.8803108

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Semantic attention has long been adopted to image captioning models to enhance the image captioning performances. The models pre-trained for attribute recognition are utilized to generate image attributes in image captioning. Generally, these models are not jointly trained with image captioning models. In this paper, we propose attribute refinement network, which incorporates attribute recognition with image captioning to boost the performance on both tasks. We model the correlation between attributes with the semantic information from image captioning to improve the recognition accuracy. In turn, better attribute recognition results effectively enhance image captioning performance. Our model achieves CIDEr-D/SPICE scores of 115.1 and 20.9 respectively on the MS COCO test set, comprehensively yields improvement over all compared methods.

引用

页码：1820 / 1824

页数：5

共 50 条

[41] Learning to Evaluate Image Captioning
Cui, Yin
Yang, Guandao
Veit, Andreas
Huang, Xun
Belongie, Serge
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5804 - 5812
[42] Meta Learning for Image Captioning
Li, Nannan
Chen, Zhenzhong
Liu, Shan
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8626 - 8633
[43] Image Captioning in Turkish Language
Yilmaz, Berk Dursun
Demir, Ali Emre
Sonmez, Elena Battini
Yildiz, Tugba
2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 413 - 417
[44] Image Captioning with Memorized Knowledge
Hui Chen
Guiguang Ding
Zijia Lin
Yuchen Guo
Caifeng Shan
Jungong Han
Cognitive Computation, 2021, 13 : 807 - 820
[45] Rich Image Captioning in the Wild
Tran, Kenneth
He, Xiaodong
Zhang, Lei
Sun, Jian
Carapcea, Cornelia
Thrasher, Chris
Buehler, Chris
Sienkiewicz, Chris
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 434 - 441
[46] Aesthetically Relevant Image Captioning
Zhong, Zhipeng
Zhou, Fei
Qiu, Guoping
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3733 - 3741
[47] Deep Image Captioning: An Overview
Hrga, I.
Ivasic-Kos, M.
2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 995 - 1000
[48] Image Captioning by Asking Questions
Yang, Xiaoshan
Xu, Changsheng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
[49] Boosted Transformer for Image Captioning
Li, Jiangyun
Yao, Peng
Guo, Longteng
Zhang, Weicun
APPLIED SCIENCES-BASEL, 2019, 9 (16):
[50] Image Captioning with Semantic Attention
You, Quanzeng
Jin, Hailin
Wang, Zhaowen
Fang, Chen
Luo, Jiebo
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4651 - 4659

← 1 2 3 4 5 →