News video retrieval by learning multimodal semantic information

被引:0
|
作者
Yu, Hui [1 ]
Su, Bolan [1 ]
Lu, Hong [1 ]
Xue, Xiangyang [1 ]
机构
[1] Fudan Univ, Dept Comp Sci & Engn, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
来源
关键词
video retrieval; rich semantic information; TRECVID; manual search task;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosion of multimedia data especially that of video data, requirement of efficient video retrieval has becoming more and more important. Years of TREC Video Retrieval Evaluation (TRECVID) research gives benchmark for video search task. The video data in TRECVID are mainly news video. In this paper a compound model consisting of several atom search modules, i.e., textual and visual, for news video retrieval is introduced. First, the analysis on query topics helps to improve the performance of video retrieval. Furthermore, the multimodal fusion of all atom search modules ensures to get good performance. Experimental results on TRECVID 2005 and TRECVID 2006 search tasks demonstrate the effectiveness of the proposed method.
引用
收藏
页码:403 / 414
页数:12
相关论文
共 50 条
  • [21] Semantic Information Retrieval for Personalized E-learning
    Zhuhadar, Leyla
    Nasraoui, Olfa
    20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 1, PROCEEDINGS, 2008, : 364 - 368
  • [22] Multimodal Video Retrieval and Multimodal Language Modelling
    Wang, Hui
    Kittler, Josef
    Gales, Mark
    Cooper, Rob
    Mulvenna, Maurice
    Ng, Wing
    Hua, Yang
    Gault, Richard
    Haider, Abbas
    Wu, Guanfeng
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1345 - 1355
  • [23] Speech retrieval for TV news programs by fusing the audio and video information
    Gao, Xinbo
    Li, Jie
    Ji, Hongbing
    International Conference on Signal Processing Proceedings, ICSP, 2002, 2 : 994 - 997
  • [24] Face retrieval in broadcasting news video by fusing temporal and intensity information
    Le, Duy-Dinh
    Satoh, Shin'ichi
    Houle, Michael E.
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 391 - 400
  • [25] Speech retrieval for TV news programs by fusing the audio and video information
    Gao, XB
    Jie, L
    Ji, HB
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 994 - 997
  • [26] Enhancing latent semantic analysis video object retrieval with structural information
    Hohl, L
    Souvannavong, F
    Merialdo, B
    Huet, B
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 1609 - 1612
  • [27] The myth of semantic video retrieval
    Dimitrova, N
    ACM COMPUTING SURVEYS, 1995, 27 (04) : 584 - 586
  • [28] Semantic information retrieval
    Pejtersen, AM
    COMMUNICATIONS OF THE ACM, 1998, 41 (04) : 90 - 92
  • [29] Semantic Information Retrieval
    Risø National Labotatory, Roskilde, Denmark
    Commun ACM, 4 (XXXX-XXXXI):
  • [30] On Semantic Similarity in Video Retrieval
    Wray, Michael
    Doughty, Hazel
    Damen, Dima
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3649 - 3659