Infrared Image Caption Based on Object-Oriented Attention

被引:3
|
作者
Lv, Junfeng [1 ]
Hui, Tian [1 ]
Zhi, Yongfeng [1 ]
Xu, Yuelei [1 ]
机构
[1] Northwestern Polytech Univ, Inst Unmanned Syst Res, Xian 710072, Peoples R China
关键词
infrared image caption; domain transfer object detection; adaptive weighting module; object oriented attention;
D O I
10.3390/e25050826
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
With the ongoing development of image technology, the deployment of various intelligent applications on embedded devices has attracted increased attention in the industry. One such application is automatic image captioning for infrared images, which involves converting images into text. This practical task is widely used in night security, as well as for understanding night scenes and other scenarios. However, due to the differences in image features and the complexity of semantic information, generating captions for infrared images remains a challenging task. From the perspective of deployment and application, to improve the correlation between descriptions and objects, we introduced the YOLOv6 and LSTM as encoder-decoder structure and proposed infrared image caption based on object-oriented attention. Firstly, to improve the domain adaptability of the detector, we optimized the pseudo-label learning process. Secondly, we proposed the object-oriented attention method to address the alignment problem between complex semantic information and embedded words. This method helps select the most crucial features of the object region and guides the caption model in generating words that are more relevant to the object. Our methods have shown good performance on the infrared image and can produce words explicitly associated with the object regions located by the detector. The robustness and effectiveness of the proposed methods were demonstrated through evaluation on various datasets, along with other state-of-the-art methods. Our approach achieved BLUE-4 scores of 31.6 and 41.2 on KAIST and Infrared City and Town datasets, respectively. Our approach provides a feasible solution for the deployment of embedded devices in industrial applications.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Object-oriented modelling based on logbooks
    vanBommel, P
    Frederiks, PJM
    vanderWeide, TP
    COMPUTER JOURNAL, 1996, 39 (09): : 793 - 799
  • [32] The design of an object-oriented user interface for the object-oriented database
    Liu, XD
    Li, LZ
    Wang, XF
    OBJECT-ORIENTED TECHNOLOGY, 1997, : 150 - 155
  • [33] Research for image caption based on global attention mechanism
    Tong, Wu
    Tao, Ku
    Hao, Zhang
    SECOND TARGET RECOGNITION AND ARTIFICIAL INTELLIGENCE SUMMIT FORUM, 2020, 11427
  • [34] OBJECT-ORIENTED REQUIREMENTS TO OBJECT-ORIENTED DESIGN - AN EASY TRANSITION
    DAVIS, AM
    JOURNAL OF SYSTEMS AND SOFTWARE, 1995, 30 (1-2) : 151 - 159
  • [35] Research on Object-oriented Remote Sensing Image Classification
    Ma, Yongli
    Zeng, Xuan
    Yu, Gangyong
    2015 4TH INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL PROTECTION (ICEEP 2015), 2015, : 1993 - 1997
  • [36] OBJECT-ORIENTED BACKDOOR ATTACK AGAINST IMAGE CAPTIONING
    Li, Meiling
    Zhong, Nan
    Zhang, Xinpeng
    Qian, Zhenxing
    Li, Sheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2864 - 2868
  • [37] Object-Oriented Disability: The Prosthetic Image in Paradise Lost
    Swarbrick, Steven
    JNT-JOURNAL OF NARRATIVE THEORY, 2019, 49 (03): : 323 - 350
  • [39] Object-oriented land cover image classification system
    Yongsheng Y.
    Qingrui C.
    Jing X.
    Recent Patents on Engineering, 2010, 4 (01) : 56 - 62
  • [40] A novel object-oriented approach to image analysis and retrieval
    Metzler, V
    Aach, T
    Thies, C
    FIFTH IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, PROCEEDINGS, 2002, : 14 - 18