A Joint Model for Text and Image Semantic Feature Extraction

被引:8
|
作者
Cao, Jiarun [1 ]
Wang, Chongwen [1 ]
Gao, Liming [1 ]
机构
[1] Beijing Inst Technol, Digital Media Res Inst, Beijing, Peoples R China
关键词
Natural language processing; Information retrieval; Similarity Calculation;
D O I
10.1145/3302425.3302437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the current information retrieval are based on keyword information appearing in the text or statistical information according to the number of vocabulary words. It is also possible to add additional semantic information by using synonyms, polysemous words, etc. to increase the accuracy of similarity and screening. However, in the current network, in addition to generate a large number of new words every day, pictures, audio, video and other information will appear too. Therefore, the manual features are difficult to express on this kind of newly appearing data, and the low-dimensional feature abstraction is very difficult to represent the overall semantics of text and images. In this paper, we propose a semantic feature extraction algorithm based on deep network, which applies the local attention mechanism to the feature generation model of pictures and texts. The retrieval of text and image information is converted into the similarity calculation of the vector, which improves the retrieval speed and ensures the semantic relevance of the result. Through the compilation of many years of news text and image data to complete the training and testing of text and image feature extraction models, the results show that the depth feature model has great advantages in semantic expression and feature extraction. On the other hand, add the similarity calculation to the training processing also improve the retrieval accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Maximum Entropy Model based on Feature Extraction for Sentiment Detection of Text
    Li, Jun
    Jin, Wei
    Zhang, Zihao
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1300 - 1307
  • [42] A new text feature extraction model and itsapplication in document copy detection
    Bao, JP
    Shen, JY
    Liu, XD
    Song, QB
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 82 - 87
  • [43] A network-based feature extraction model for imbalanced text data
    Li, Keping
    Yan, Dongyang
    Liu, Yanyan
    Zhu, Qiaozhen
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [44] Feature Differentiation and Fusion for Semantic Text Matching
    Peng, Rui
    Hong, Yu
    Jin, Zhiling
    Yao, Jianmin
    Zhou, Guodong
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 32 - 46
  • [45] JOINT LEARNING OF IMAGE AESTHETIC QUALITY ASSESSMENT AND SEMANTIC RECOGNITION BASED ON FEATURE ENHANCEMENT
    Liu, Xiangfei
    Nie, Xiushan
    Shen, Zhen
    Yin, Yilong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2075 - 2079
  • [46] JOINT MULTI-FEATURE HYPERSPECTRAL IMAGE CLASSIFICATION WITH SPATIAL CONSTRAINT IN SEMANTIC MANIFOLD
    Zhang, Xiangrong
    Gao, Zeyu
    An, Jinliang
    Hu, Yanning
    Li, Yangyang
    Hou, Biao
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 481 - 484
  • [47] Feature Extraction Using Semantic Similarity
    Aboelela, Eman M.
    Gad, Walaa
    Ismail, Rasha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 82 - 91
  • [48] A SEMANTIC APPROACH TO THE EXTRACTION OF FEATURE TERMS
    Angioni, Manuela
    Tuveri, Franco
    ICSOFT 2011: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATABASE TECHNOLOGIES, VOL 2, 2011, : 402 - 407
  • [49] A Review on Feature Selection and Feature Extraction for Text Classification
    Shah, Foram P.
    Patel, Vibha
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2264 - 2268
  • [50] Semantic retrieval in a large-scale video database by using both image and text feature
    Yu, Chuan
    Mo, Hiroshi
    Katayama, Norio
    Satoh, Shin'ichi
    Asano, Shoichiro
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3332 : 770 - 777