A Joint Model for Text and Image Semantic Feature Extraction

被引:8
|
作者
Cao, Jiarun [1 ]
Wang, Chongwen [1 ]
Gao, Liming [1 ]
机构
[1] Beijing Inst Technol, Digital Media Res Inst, Beijing, Peoples R China
关键词
Natural language processing; Information retrieval; Similarity Calculation;
D O I
10.1145/3302425.3302437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the current information retrieval are based on keyword information appearing in the text or statistical information according to the number of vocabulary words. It is also possible to add additional semantic information by using synonyms, polysemous words, etc. to increase the accuracy of similarity and screening. However, in the current network, in addition to generate a large number of new words every day, pictures, audio, video and other information will appear too. Therefore, the manual features are difficult to express on this kind of newly appearing data, and the low-dimensional feature abstraction is very difficult to represent the overall semantics of text and images. In this paper, we propose a semantic feature extraction algorithm based on deep network, which applies the local attention mechanism to the feature generation model of pictures and texts. The retrieval of text and image information is converted into the similarity calculation of the vector, which improves the retrieval speed and ensures the semantic relevance of the result. Through the compilation of many years of news text and image data to complete the training and testing of text and image feature extraction models, the results show that the depth feature model has great advantages in semantic expression and feature extraction. On the other hand, add the similarity calculation to the training processing also improve the retrieval accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Entity Semantic Feature Fusion Network for Remote Sensing Image-Text Retrieval
    Shui, Jianan
    Ding, Shuaipeng
    Li, Mingyong
    Ma, Yan
    WEB AND BIG DATA, APWEB-WAIM 2024, PT V, 2024, 14965 : 130 - 145
  • [32] MUSH: Multi-scale Hierarchical Feature Extraction for Semantic Image Synthesis
    Wang, Zicong
    Ren, Qiang
    Wang, Junli
    Yan, Chungang
    Jiang, Changjun
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 185 - 201
  • [33] Image Semantic Segmentation Scheme based on XGBoost combination with Convolution Feature Extraction
    Dai, Zichen
    Liu, Xuewen
    Xu, Chi
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6334 - 6340
  • [34] A tag based joint extraction model for Chinese medical text
    Liu, XingYu
    Liu, Yu
    Wu, HangYu
    Guan, QingQuan
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 93
  • [35] Feature extraction for document image segmentation by pLSA model
    Yamaguchi, Takuma
    Maruyama, Minoru
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 53 - 60
  • [36] A New Feature Extraction Model Based on Image Gradient
    Zhao, Minghua
    Yuan, Yongqin
    Li, Bing
    Shi, Zhenghao
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 390 - 394
  • [37] An investigation on feature and text extraction from images using image recognition in Android
    Panchal, Brijeshkumar Y.
    Chauhan, Gaurang
    Panchal, Sandipkumar R.
    Chaudhari, Urvashi M.
    MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 798 - 802
  • [38] Text String Extraction from Scene Image Based on Edge Feature and Morphology
    Wang, Yuming
    Tanaka, Naoki
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 323 - 328
  • [39] Remote sensing image-text retrieval based on layout semantic joint representation
    Zhang R.
    Nie J.
    Song N.
    Zheng C.
    Wei Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 671 - 683
  • [40] Joint Transform Correlator Based on Joint Image Feature Extraction Using Swarm Intelligence Method
    Wang, Yong
    Zhu, Ming
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 4964 - 4969