Neural Caption Generation for News Images

被引：0

作者：

Batra, Vishwash ^{[1
]}

He, Yulan ^{[1
]}

Vogiatzis, George ^{[1
]}

机构：

[1] Aston Univ, Sch Engn & Appl Sci, Birmingham, W Midlands, England

来源：

PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年

关键词：

Recurrent Neural Networks; Image caption generation; Deep learning; Order Embedding;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Automatic caption generation of images has gained significant interest. It gives rise to a lot of interesting image-related applications. For example, it could help in image/video retrieval and management of vast amount of multimedia data available on the Internet. It could also help in development of tools that can aid visually impaired individuals in accessing multimedia content. In this paper, we particularly focus on news images and propose a methodology for automatically generating captions for news paper articles consisting of a text paragraph and an image. We propose several deep neural network architectures built upon Recurrent Neural Networks. Results on a BBC News dataset show that our proposed approach outperforms a traditional method based on Latent Dirichlet Allocation using both automatic evaluation based on BLEU scores and human evaluation.

引用

页码：1726 / 1733

页数：8

共 50 条

[31] A Method of Caption Location and Segmentation in News Video
Huang, He
Shi, Ping
Yang, Laiwen
2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 365 - 369
[32] Generative Caption for Diabetic Retinopathy Images
Wu, Luhui
Wan, Cheng
Wu, Yiquan
Liu, Jiang
2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 515 - 519
[33] Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images
Hongbin Zhang
Diedie Qiu
Renzhong Wu
Donghong Ji
Guangli Li
Zhenyu Niu
Tao Li
Soft Computing, 2020, 24 : 1377 - 1397
[34] Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images
Zhang, Hongbin
Qiu, Diedie
Wu, Renzhong
Ji, Donghong
Li, Guangli
Niu, Zhenyu
Li, Tao
SOFT COMPUTING, 2020, 24 (02) : 1377 - 1397
[35] An improved algorithm of news video caption detection and recognition
Yang, Qiang
2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1549 - 1552
[36] A Multimodal Framework for Video Caption Generation
Bhooshan, Reshmi S.
Suresh, K.
IEEE Access, 2022, 10 : 92166 - 92176
[37] Image Caption Generation With Adaptive Transformer
Zhang, Wei
Nie, Wenbo
Li, Xinle
Yu, Yao
2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 521 - 526
[38] Caption Text Location with Combined Features for News Videos
Su, Yuting
Ji, Zhong
Song, Xingguang
Hua, Rui
2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 714 - 718
[39] A Multimodal Framework for Video Caption Generation
Bhooshan, Reshmi S.
Suresh, K.
IEEE ACCESS, 2022, 10 : 92166 - 92176
[40] An Overview of Image Caption Generation Methods
Wang, Haoran
Zhang, Yue
Yu, Xiaosheng
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020

← 1 2 3 4 5 →