A survey on multimodal video representation for semantic retrieval

被引:0
|
作者
Calic, J [1 ]
Campbell, N [1 ]
Dasiopoulou, S [1 ]
Kompatsiaris, Y [1 ]
机构
[1] Univ Bristol, Dept Comp Sci, Bristol BS8 1UB, Avon, England
关键词
video representation; multimodality; content-based indexing and retrieval; semantic gap;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper surveys the approaches to video representation, focusing on semantic analysis for content-based indexing and retrieval. A problem of adaptive representation of digital multimedia is critically assessed and some novel ideas are presented. Furthermore, the concept of video multimodality is reevaluated and redefined in order to introduce modalities such as editing technique or affect to the audience.
引用
收藏
页码:135 / 138
页数:4
相关论文
共 50 条
  • [1] News video retrieval by learning multimodal semantic information
    Yu, Hui
    Su, Bolan
    Lu, Hong
    Xue, Xiangyang
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 403 - 414
  • [2] Reducing Semantic Gap in Video Retrieval with Fusion: A survey
    Sudha, D.
    Priyadarshini, J.
    BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 496 - 502
  • [3] Group sparse representation for image categorization and semantic video retrieval
    Liu YaNan
    Wu Fei
    Zhuang YueTing
    SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (10) : 2051 - 2063
  • [4] Group sparse representation for image categorization and semantic video retrieval
    LIU YaNan 1
    2 College of Computer Science and Technology
    Science China(Information Sciences), 2011, 54 (10) : 2051 - 2063
  • [5] Group sparse representation for image categorization and semantic video retrieval
    YaNan Liu
    Fei Wu
    YueTing Zhuang
    Science China Information Sciences, 2011, 54 : 2051 - 2063
  • [6] Semantic retrieval of video
    Xiong, ZY
    Zhou, XS
    Tian, Q
    Rui, Y
    Huang, TS
    IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (02) : 18 - 27
  • [7] Multimodal feature extraction and fusion for semantic mining of soccer video: a survey
    Payam Oskouie
    Sara Alipour
    Amir-Masoud Eftekhari-Moghadam
    Artificial Intelligence Review, 2014, 42 : 173 - 210
  • [8] Multimodal feature extraction and fusion for semantic mining of soccer video: a survey
    Oskouie, Payam
    Alipour, Sara
    Eftekhari-Moghadam, Amir-Masoud
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (02) : 173 - 210
  • [9] Multimodal semantic enhanced representation network for micro-video event detection
    Li, Yun
    Liu, Xianyi
    Zhang, Lijuan
    Tian, Haoyu
    Jing, Peiguang
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [10] Multimodal Video Retrieval and Multimodal Language Modelling
    Wang, Hui
    Kittler, Josef
    Gales, Mark
    Cooper, Rob
    Mulvenna, Maurice
    Ng, Wing
    Hua, Yang
    Gault, Richard
    Haider, Abbas
    Wu, Guanfeng
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1345 - 1355