News video retrieval by learning multimodal semantic information

被引：0

作者：

Yu, Hui ^{[1
]}

Su, Bolan ^{[1
]}

Lu, Hong ^{[1
]}

Xue, Xiangyang ^{[1
]}

机构：

[1] Fudan Univ, Dept Comp Sci & Engn, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China

来源：

ADVANCES IN VISUAL INFORMATION SYSTEMS | 2007年 / 4781卷

关键词：

video retrieval; rich semantic information; TRECVID; manual search task;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the explosion of multimedia data especially that of video data, requirement of efficient video retrieval has becoming more and more important. Years of TREC Video Retrieval Evaluation (TRECVID) research gives benchmark for video search task. The video data in TRECVID are mainly news video. In this paper a compound model consisting of several atom search modules, i.e., textual and visual, for news video retrieval is introduced. First, the analysis on query topics helps to improve the performance of video retrieval. Furthermore, the multimodal fusion of all atom search modules ensures to get good performance. Experimental results on TRECVID 2005 and TRECVID 2006 search tasks demonstrate the effectiveness of the proposed method.

引用

页码：403 / 414

页数：12

共 50 条

[41] Semantic Description and Information Retrieval Research of Surveillance Video in Smart Transportation System
Yang, Boxiong
Huang, Jing
Yang, Yuqi
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 238 - 244
[42] Multimodal search for effective video retrieval
Natsev, Apostol
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 525 - 528
[43] Role of Semantic Links in Performance of Information Retrieval on Graph-based Multimodal Collections
Sabetghadam, Serwah
Lupu, Mihai
Rauber, Andreas
2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1574 - 1579
[44] A model for multimodal information retrieval
Srihari, RK
Rao, AB
Han, B
Munirathnam, S
Wu, XY
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 701 - 704
[45] Semantic indexing for instructional video via combination of handwriting recognition and information retrieval
Tang, LJ
Kender, JR
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 921 - 924
[46] LOOK, TELL AND MATCH: REFINING VIDEO-TEXT RETRIEVAL WITH SEMANTIC INFORMATION
Zhu Jinkuan
Hu Weiyi
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
[47] Towards Semantic Multimodal Video Annotation
Grassi, Marco
Morbidoni, Christian
Piazza, Francesco
TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 305 - 316
[48] VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles
Li, Mingzhe
Chen, Xiuying
Gao, Shen
Chan, Zhangming
Zhao, Dongyan
Yan, Rui
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9360 - 9369
[49] Regim VID A Semantic and Personalized Framework for News Video Retrieval Based on Textual and Visual Transcripts
Karray, Hichem
Ben Ammar, Anis
Alimi, Adel M.
JOURNAL OF DECISION SYSTEMS, 2011, 20 (04) : 467 - 490
[50] Detection and retrieval of captions in news video
Luo, M
Bai, XS
Xu, GG
VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 233 - 238

← 1 2 3 4 5 →