Neural Caption Generation for News Images

被引:0
|
作者
Batra, Vishwash [1 ]
He, Yulan [1 ]
Vogiatzis, George [1 ]
机构
[1] Aston Univ, Sch Engn & Appl Sci, Birmingham, W Midlands, England
关键词
Recurrent Neural Networks; Image caption generation; Deep learning; Order Embedding;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic caption generation of images has gained significant interest. It gives rise to a lot of interesting image-related applications. For example, it could help in image/video retrieval and management of vast amount of multimedia data available on the Internet. It could also help in development of tools that can aid visually impaired individuals in accessing multimedia content. In this paper, we particularly focus on news images and propose a methodology for automatically generating captions for news paper articles consisting of a text paragraph and an image. We propose several deep neural network architectures built upon Recurrent Neural Networks. Results on a BBC News dataset show that our proposed approach outperforms a traditional method based on Latent Dirichlet Allocation using both automatic evaluation based on BLEU scores and human evaluation.
引用
收藏
页码:1726 / 1733
页数:8
相关论文
共 50 条
  • [31] A Method of Caption Location and Segmentation in News Video
    Huang, He
    Shi, Ping
    Yang, Laiwen
    2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 365 - 369
  • [32] Generative Caption for Diabetic Retinopathy Images
    Wu, Luhui
    Wan, Cheng
    Wu, Yiquan
    Liu, Jiang
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 515 - 519
  • [33] Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images
    Hongbin Zhang
    Diedie Qiu
    Renzhong Wu
    Donghong Ji
    Guangli Li
    Zhenyu Niu
    Tao Li
    Soft Computing, 2020, 24 : 1377 - 1397
  • [34] Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images
    Zhang, Hongbin
    Qiu, Diedie
    Wu, Renzhong
    Ji, Donghong
    Li, Guangli
    Niu, Zhenyu
    Li, Tao
    SOFT COMPUTING, 2020, 24 (02) : 1377 - 1397
  • [35] An improved algorithm of news video caption detection and recognition
    Yang, Qiang
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1549 - 1552
  • [36] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE Access, 2022, 10 : 92166 - 92176
  • [37] Image Caption Generation With Adaptive Transformer
    Zhang, Wei
    Nie, Wenbo
    Li, Xinle
    Yu, Yao
    2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 521 - 526
  • [38] Caption Text Location with Combined Features for News Videos
    Su, Yuting
    Ji, Zhong
    Song, Xingguang
    Hua, Rui
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 714 - 718
  • [39] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE ACCESS, 2022, 10 : 92166 - 92176
  • [40] An Overview of Image Caption Generation Methods
    Wang, Haoran
    Zhang, Yue
    Yu, Xiaosheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020