Automatic Generation of Description for Images Using Recurrent Neural Network

被引:0
|
作者
Veena, G. S. [1 ]
Patil, Savitri [1 ]
Kumar, T. N. R. [1 ]
机构
[1] Ramaiah Inst Technol, Bengaluru, India
来源
关键词
Artificial intelligence; Consciousness; Deep learning; Intelligent agent; Long short-term memory; InceptionV3; Flickr30k; Bilingual evaluation understudy;
D O I
10.1007/978-981-13-7150-9_44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a generative long short-term memory (LSTM) model for generating description of the image is implemented. Automatic generation of description that describes the content of a given image is a fundamental problem in artificial intelligence. This kind of work is achieved by connecting two different domains like computer vision and natural language processing. The solution proposed here makes use of deep learning. A deep learning framework known as Keras is used which uses TensorFlow for the backend process. TensorFlow is a framework used to do a series of operations in a chain. The general technique is to feed the features of an image to the model, which is capable of generating text of length less than or equal to a predefined caption length. The dataset Flickr30 K is used to train the model. The InceptionV3 is used to extract features of the images. BLEU metric is used to measure the accuracy of the description that is generated for that image using LSTM model.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Automatic Music Generator Using Recurrent Neural Network
    Alexander Agung Santoso Gunawan
    Ananda Phan Iman
    Derwin Suhartono
    International Journal of Computational Intelligence Systems, 2020, 13 : 645 - 654
  • [2] Automatic Music Generator Using Recurrent Neural Network
    Gunawan, Alexander Agung Santoso
    Iman, Ananda Phan
    Suhartono, Derwin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 645 - 654
  • [3] Word Level LSTM and Recurrent Neural Network for Automatic Text Generation
    Buddana, Harsha Vardhana Krishna Sai
    Kaushik, Surampudi Sai
    Manogna, Pvs
    Kumar, Shijin P. S.
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [4] Automatic Text Generation in Macedonian Using Recurrent Neural Networks
    Milanova, Ivona
    Sarvanoska, Ksenija
    Srbinoski, Viktor
    Gjoreski, Hristijan
    ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 1 - 12
  • [5] Using Recurrent Neural Network for Hash Function Generation
    Turcanik, Michal
    2017 INTERNATIONAL CONFERENCE ON APPLIED ELECTRONICS (AE), 2017, : 253 - 256
  • [6] Automatic playlist generation using Convolutional Neural Networks and Recurrent Neural Networks
    Irene, Rosilde Tatiana
    Borrelli, Clara
    Zanoni, Massimiliano
    Buccoli, Michele
    Sarti, Augusto
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [7] Automatic Noise Identification in Images Using Moments and Neural Network
    Vasuki, P.
    Roomi, S. Mohamed Mansoor
    Bhavana, C.
    Deebikaa, E. Lakshmi
    2012 INTERNATIONAL CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2012, : 61 - 64
  • [8] Automatic Generation of Medical Imaging Diagnostic Report with Hierarchical Recurrent Neural Network
    Yin, Changchang
    Qian, Buyue
    Wei, Jishang
    Li, Xiaoyu
    Zhang, Xianli
    Li, Yang
    Zheng, Qinghua
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 728 - 737
  • [9] Automatic Classification of Volumetric Optical Coherence Tomography Images via Recurrent Neural Network
    Wang, Chong
    Jin, Yuxuan
    Chen, Xiangdong
    Liu, Zhimin
    SENSING AND IMAGING, 2020, 21 (01):
  • [10] Automatic Classification of Volumetric Optical Coherence Tomography Images via Recurrent Neural Network
    Chong Wang
    Yuxuan Jin
    Xiangdong Chen
    Zhimin Liu
    Sensing and Imaging, 2020, 21