Searching for memory-lighter architectures for OCR-augmented image captioning

被引:0
|
作者
Gallardo-García, Rafael [1 ]
Beltrán-Martínez, Beatriz [1 ]
Hernández-Gracidas, Carlos [2 ]
Vilariño-Ayala, Darnes [1 ]
机构
[1] Language and Knowledge Engineering Laboratory, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
[2] Faculty of Physical and Mathematical Sciences, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
来源
关键词
Memory architecture;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:4399 / 4410
相关论文
共 8 条
  • [1] Searching for memory-lighter architectures for OCR-augmented image captioning
    Gallardo-Garcia, Rafael
    Beltran-Martinez, Beatriz
    Hernandez-Gracidas, Carlos
    Vilarino-Aya, Darnes
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4399 - 4410
  • [2] Image and Video Captioning with Augmented Neural Architectures
    Shetty, Rakshith
    Tavakoli, Hamed R.
    Laaksonen, Jorma
    IEEE MULTIMEDIA, 2018, 25 (02) : 34 - 46
  • [3] Memory-Augmented Image Captioning
    Fei, Zhengcong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1317 - 1324
  • [4] Towards Retrieval-Augmented Architectures for Image Captioning
    Sarto, Sara
    Cornia, Marcella
    Baraldi, Lorenzo
    Nicolosi, Alessandro
    Cucchiara, Rita
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08)
  • [5] MeaCap: Memory-Augmented Zero-shot Image Captioning
    Zeng, Zequn
    Xie, Yan
    Zhang, Hao
    Chen, Chiyu
    Chen, Bo
    Wang, Zhengjue
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14100 - 14110
  • [6] MIRA-CAP: Memory-Integrated Retrieval-Augmented Captioning for State-of-the-Art Image and Video Captioning
    Umirzakova, Sabina
    Muksimova, Shakhnoza
    Mardieva, Sevara
    Baxtiyarovich, Murodjon Sultanov
    Cho, Young-Im
    SENSORS, 2024, 24 (24)
  • [7] Retrieval-enhanced adversarial training with dynamic memory-augmented attention for image paragraph captioning
    Xu, Chunpu
    Yang, Min
    Ao, Xiang
    Shen, Ying
    Xu, Ruifeng
    Tian, Jinwen
    KNOWLEDGE-BASED SYSTEMS, 2021, 214
  • [8] From grids to pseudo-regions: Dynamic memory augmented image captioning with dual relation transformer
    Zhou, Wei
    Jiang, Weitao
    Zheng, Zhijie
    Li, Jianchao
    Su, Tao
    Hu, Haifeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273