Domain-specific image captioning: a comprehensive review

被引:1
|
作者
Sharma, Himanshu [1 ]
Padha, Devanand [1 ]
机构
[1] Cent Univ Jammu, Dept Comp Sci & Informat Technol, Jammu 181124, Jammu & Kashmir, India
关键词
Computer vision; Deep learning; Medical image captioning; Natural image captioning; Remote sensing image captioning; AUTOMATIC IMAGE; GENERATION; MODELS; RETRIEVAL; SPEECH;
D O I
10.1007/s13735-024-00328-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An image caption is a sentence summarizing the semantic details of an image. It is a blended application of computer vision and natural language processing. The earlier research addressed this domain using machine learning approaches by modeling image captioning frameworks using hand-engineered feature extraction techniques. With the resurgence of deep-learning approaches, the development of improved and efficient image captioning frameworks is on the rise. Image captioning is witnessing tremendous growth in various domains as medical, remote sensing, security, visual assistance, and multimodal search engines. In this survey, we comprehensively study the image captioning frameworks based on our proposed domain-specific taxonomy. We explore the benchmark datasets and metrics leveraged for training and evaluating image captioning models in various application domains. In addition, we also perform a comparative analysis of the reviewed models. Natural image captioning, medical image captioning, and remote sensing image captioning are currently among the most prominent application domains of image captioning. The efficacy of real-time image captioning is a challenging obstacle limiting its implementation in sensitive areas such as visual aid, remote security, and healthcare. Further challenges include the scarcity of rich domain-specific datasets, training complexity, evaluation difficulty, and a deficiency of cross-domain knowledge transfer techniques. Despite the significant contributions made, there is a need for additional efforts to develop steadfast and influential image captioning models.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] A Comprehensive Web-based Platform For Domain-Specific Biological Models
    Klement, M.
    Safranek, D.
    Ded, T.
    Pejznoch, A.
    Nedbal, L.
    Steuer, R.
    Cerveny, J.
    Mueller, S.
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 299 : 61 - 67
  • [22] Untangling Crosscutting Concerns in Domain-specific Languages with Domain-specific Join Points
    Dinkelaker, Tom
    Monperrus, Martin
    Mezini, Mira
    DSAL09: DOMAIN-SPECIFIC ASPECT LANGUAGES, 2009, : 1 - 5
  • [23] Domain-specific feature elimination: multi-source domain adaptation for image classification
    Wu, Kunhong
    Jia, Fan
    Han, Yahong
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
  • [24] Common Dictionary and Domain-Specific Dictionary based Cross-Domain Image Classification
    Zhang, Kangkang
    Yuan, Meigui
    Xiong, Youling
    Qu, Lei
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 2824 - 2829
  • [25] Domain-specific feature elimination:multi-source domain adaptation for image classification
    Kunhong WU
    Fan JIA
    Yahong HAN
    Frontiers of Computer Science, 2023, 17 (04) : 168 - 176
  • [26] Unsupervised Image-to-Image Translation Using Domain-Specific Variational Information Bound
    Kazemi, Hadi
    Soleymani, Sobhan
    Taherkhani, Fariborz
    Iranmanesh, Seyed Mehdi
    Nasrabadi, Nasser M.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [27] Generate domain-specific sentiment lexicon for review sentiment analysis
    Hongyu Han
    Jianpei Zhang
    Jing Yang
    Yiran Shen
    Yongshi Zhang
    Multimedia Tools and Applications, 2018, 77 : 21265 - 21280
  • [28] Generate domain-specific sentiment lexicon for review sentiment analysis
    Han, Hongyu
    Zhang, Jianpei
    Yang, Jing
    Shen, Yiran
    Zhang, Yongshi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21265 - 21280
  • [29] Halide and GENESIS for Generating Domain-Specific Architecture of Guided Image Filtering
    Ishikawa, Akari
    Fukushima, Norishige
    Maruoka, Akira
    Iizuka, Takuro
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [30] A selected review of the literature on development of learners' domain-specific knowledge
    Dodds, P
    Griffin, LL
    Placek, JH
    JOURNAL OF TEACHING IN PHYSICAL EDUCATION, 2001, 20 (04) : 301 - 313