News video retrieval by learning multimodal semantic information

被引:0
|
作者
Yu, Hui [1 ]
Su, Bolan [1 ]
Lu, Hong [1 ]
Xue, Xiangyang [1 ]
机构
[1] Fudan Univ, Dept Comp Sci & Engn, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
来源
关键词
video retrieval; rich semantic information; TRECVID; manual search task;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosion of multimedia data especially that of video data, requirement of efficient video retrieval has becoming more and more important. Years of TREC Video Retrieval Evaluation (TRECVID) research gives benchmark for video search task. The video data in TRECVID are mainly news video. In this paper a compound model consisting of several atom search modules, i.e., textual and visual, for news video retrieval is introduced. First, the analysis on query topics helps to improve the performance of video retrieval. Furthermore, the multimodal fusion of all atom search modules ensures to get good performance. Experimental results on TRECVID 2005 and TRECVID 2006 search tasks demonstrate the effectiveness of the proposed method.
引用
收藏
页码:403 / 414
页数:12
相关论文
共 50 条
  • [41] Semantic Description and Information Retrieval Research of Surveillance Video in Smart Transportation System
    Yang, Boxiong
    Huang, Jing
    Yang, Yuqi
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 238 - 244
  • [42] Multimodal search for effective video retrieval
    Natsev, Apostol
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 525 - 528
  • [43] Role of Semantic Links in Performance of Information Retrieval on Graph-based Multimodal Collections
    Sabetghadam, Serwah
    Lupu, Mihai
    Rauber, Andreas
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1574 - 1579
  • [44] A model for multimodal information retrieval
    Srihari, RK
    Rao, AB
    Han, B
    Munirathnam, S
    Wu, XY
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 701 - 704
  • [45] Semantic indexing for instructional video via combination of handwriting recognition and information retrieval
    Tang, LJ
    Kender, JR
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 921 - 924
  • [46] LOOK, TELL AND MATCH: REFINING VIDEO-TEXT RETRIEVAL WITH SEMANTIC INFORMATION
    Zhu Jinkuan
    Hu Weiyi
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [47] Towards Semantic Multimodal Video Annotation
    Grassi, Marco
    Morbidoni, Christian
    Piazza, Francesco
    TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 305 - 316
  • [48] VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles
    Li, Mingzhe
    Chen, Xiuying
    Gao, Shen
    Chan, Zhangming
    Zhao, Dongyan
    Yan, Rui
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9360 - 9369
  • [49] Regim VID A Semantic and Personalized Framework for News Video Retrieval Based on Textual and Visual Transcripts
    Karray, Hichem
    Ben Ammar, Anis
    Alimi, Adel M.
    JOURNAL OF DECISION SYSTEMS, 2011, 20 (04) : 467 - 490
  • [50] Detection and retrieval of captions in news video
    Luo, M
    Bai, XS
    Xu, GG
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 233 - 238