Automatic Caption Generation for News Images

被引:61
|
作者
Feng, Yansong [1 ]
Lapata, Mirella [2 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, 128 Zhong Guan Cun N St, Beijing 100871, Peoples R China
[2] Univ Edinburgh, Informat Forum, Inst Language Cognit & Computat, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
关键词
Caption generation; image annotation; summarization; topic models; NATURAL-LANGUAGE;
D O I
10.1109/TPAMI.2012.118
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with the task of automatically generating captions for images, which is important for many image-related applications. Examples include video and image retrieval as well as the development of tools that aid visually impaired individuals to access pictorial information. Our approach leverages the vast resource of pictures available on the web and the fact that many of them are captioned and colocated with thematically related documents. Our model learns to create captions from a database of news articles, the pictures embedded in them, and their captions, and consists of two stages. Content selection identifies what the image and accompanying article are about, whereas surface realization determines how to verbalize the chosen content. We approximate content selection with a probabilistic image annotation model that suggests keywords for an image. The model postulates that images and their textual descriptions are generated by a shared set of latent variables (topics) and is trained on a weakly labeled dataset (which treats the captions and associated news articles as image labels). Inspired by recent work in summarization, we propose extractive and abstractive surface realization models. Experimental results show that it is viable to generate captions that are pertinent to the specific content of an image and its associated article, while permitting creativity in the description. Indeed, the output of our abstractive model compares favorably to handwritten captions and is often superior to extractive methods.
引用
收藏
页码:797 / 812
页数:16
相关论文
共 50 条
  • [1] Neural Caption Generation for News Images
    Batra, Vishwash
    He, Yulan
    Vogiatzis, George
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1726 - 1733
  • [2] Automatic Caption Generation for Medical Images
    Allaouzi, Imane
    Ben Ahmed, M.
    Benamrou, B.
    Ouardouz, M.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA'18), 2018,
  • [3] How Many Words is a Picture Worth? Automatic Caption Generation for News Images
    Feng, Yansong
    Lapata, Mirella
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1239 - 1249
  • [4] GENERATION OF CAPTION SELECTION FOR NEWS IMAGES USING STEMMING ALGORITHM
    Vijay, K.
    Ramya, D.
    2015 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC), 2015, : 536 - 540
  • [5] Automatic Caption Generation for annotated images by using clustering algorithm
    Reddy, A. Sivakrishna
    Monolisa, N.
    Nathiya, M.
    Anjugam, D.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [6] A survey on automatic image caption generation
    Bai, Shuang
    An, Shan
    NEUROCOMPUTING, 2018, 311 : 291 - 304
  • [7] A novel framework for automatic caption and audio generation
    Kulkarni, Chaitanya
    Monika, P.
    Preeti, B.
    Shruthi, S.
    MATERIALS TODAY-PROCEEDINGS, 2022, 65 : 3248 - 3252
  • [8] AutoCaption: Automatic Caption Generation for Personal Photos
    Ramnath, Krishnan
    Baker, Simon
    Vanderwende, Lucy
    El-Saban, Motaz
    Sinha, Sudipta N.
    Kannan, Anitha
    Hassan, Noran
    Galley, Michel
    Yang, Yi
    Ramanan, Deva
    Bergamo, Alessandro
    Torresani, Lorenzo
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 1050 - 1057
  • [9] Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images
    Hongbin Zhang
    Diedie Qiu
    Renzhong Wu
    Donghong Ji
    Guangli Li
    Zhenyu Niu
    Tao Li
    Soft Computing, 2020, 24 : 1377 - 1397
  • [10] Transformer based image caption generation for news articles
    Pande, Ashtavinayak
    Pandey, Atul
    Solanki, Ayush
    Shanbhag, Chinmay
    Motghare, Manish
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01):