GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval

被引:0
|
作者
Wang, Yuting [1 ,3 ]
Wang, Jinpeng [1 ,3 ]
Chen, Bin [2 ,3 ]
Zeng, Ziyun [1 ,3 ]
Xia, Shu-Tao [1 ,3 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Beijing, Peoples R China
[2] Harbin Inst Technol, Shenzhen, Peoples R China
[3] Peng Cheng Lab, Res Ctr Artificial Intelligence, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a text query, partially relevant video retrieval (PRVR) seeks to find untrimmed videos containing pertinent moments in a database. For PRVR, clip modeling is essential to capture the partial relationship between texts and videos. Current PRVR methods adopt scanning-based clip construction to achieve explicit clip modeling, which is information-redundant and requires a large storage overhead. To solve the efficiency problem of PRVR methods, this paper proposes GMMFormer, a Gaussian-Mixture-Model based Transformer which models clip representations implicitly. During frame interactions, we incorporate Gaussian-Mixture-Model constraints to focus each frame on its adjacent frames instead of the whole video. Then generated representations will contain multi-scale clip information, achieving implicit clip modeling. In addition, PRVR methods ignore semantic differences between text queries relevant to the same video, leading to a sparse embedding space. We propose a query diverse loss to distinguish these text queries, making the embedding space more intensive and contain more semantic information. Extensive experiments on three large-scale video datasets (i.e., TVR, ActivityNet Captions, and Charades-STA) demonstrate the superiority and efficiency of GMMFormer. Code is available at https://github.com/huangmozhi9527/GMMFormer.
引用
收藏
页码:5767 / 5775
页数:9
相关论文
共 50 条
  • [41] Video fire detection based on Gaussian Mixture Model and multi-color features
    Xian-Feng Han
    Jesse S. Jin
    Ming-Jie Wang
    Wei Jiang
    Lei Gao
    Li-Ping Xiao
    Signal, Image and Video Processing, 2017, 11 : 1419 - 1425
  • [42] Video fire detection based on Gaussian Mixture Model and multi-color features
    Han, Xian-Feng
    Jin, Jesse S.
    Wang, Ming-Jie
    Jiang, Wei
    Gao, Lei
    Xiao, Li-Ping
    SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (08) : 1419 - 1425
  • [43] A Method of Surveillance Video Structured Based on Gaussian Mixture Model and Support Vector Machine
    Wu, Jinyong
    Zhao, Yong
    Yuan, Yule
    Zhang, Xing
    Wang, Yike
    2012 THIRD GLOBAL CONGRESS ON INTELLIGENT SYSTEMS (GCIS 2012), 2012, : 282 - 286
  • [44] Image Retrieval Based on Gaussian Mixture Approach to Color Localization
    Luszczkiewicz-Piatek, Maria
    Smolka, Bogdan
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT I, 2011, 6978 : 347 - 355
  • [45] Efficient Volume Exploration Using the Gaussian Mixture Model
    Wang, Yunhai
    Chen, Wei
    Zhang, Jian
    Dong, Tingxing
    Shan, Guihua
    Chi, Xuebin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (11) : 1560 - 1573
  • [46] Content-based image retrieval by using Deep Kernel Machine with Gaussian Mixture Model
    Kiasari, Mohammad Ahangar
    Lee, Minho
    Shen, Jixiang
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3371 - 3377
  • [47] Efficient Gaussian Mixture Model Evaluation in Voice Conversion
    Tian, Jilei
    Nurminen, Jani
    Popa, Victor
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2282 - 2285
  • [48] Texture image retrieval based on a Gaussian Mixture Model and similarity measure using a Kullback divergence
    Yuan, H
    Zhang, XP
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1867 - 1870
  • [49] A Probabilistic Quality-Relevant Monitoring Method With Gaussian Mixture Model
    Yu, Wanke
    Zhao, Chunhui
    Huang, Biao
    Yang, Hui
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 12
  • [50] Reduced-Degrees-of-Freedom Gaussian-Mixture-Model Fitting for Large-Scale History-Matching Problems
    Gao, Guohua
    Jiang, Hao
    Chen, Chaohui
    Vink, Jeroen C.
    El Khamra, Yaakoub
    Ita, Joel
    Saaf, Fredrik
    SPE JOURNAL, 2020, 25 (01): : 37 - 55