Query-Oriented Micro-Video Summarization

被引:0
|
作者
Jia, Mengzhao [1 ]
Wei, Yinwei [2 ]
Song, Xuemeng [1 ]
Sun, Teng [1 ]
Zhang, Min [3 ]
Nie, Liqiang [3 ]
机构
[1] Shandong Univ, Dept Comp Sci & Technol, Qingdao 250100, Peoples R China
[2] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[3] Harbin Inst Technol, Sch Comp, Shenzhen 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Video summarization; query suggestion; micro-video retrieval;
D O I
10.1109/TPAMI.2024.3355402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Query-oriented micro-video summarization task aims to generate a concise sentence with two properties: (a) summarizing the main semantic of the micro-video and (b) being expressed in the form of search queries to facilitate retrieval. Despite its enormous application value in the retrieval area, this direction has barely been explored. Previous studies of summarization mostly focus on the content summarization for traditional long videos. Directly applying these studies is prone to gain unsatisfactory results because of the unique features of micro-videos and queries: diverse entities and complex scenes within a short time, semantic gaps between modalities, and various queries in distinct expressions. To specifically adapt to these characteristics, we propose a query-oriented micro-video summarization model, dubbed QMS. It employs an encoder-decoder-based transformer architecture as the skeleton. The multi-modal (visual and textual) signals are passed through two modal-specific encoders to obtain their representations, followed by an entity-aware representation learning module to identify and highlight critical entity information. As to the optimization, regarding the large semantic gaps between modalities, we assign different confidence scores according to their semantic relevance in the optimization process. Additionally, we develop a novel strategy to sample the effective target query among the diverse query set with various expressions. Extensive experiments demonstrate the superiority of the QMS scheme, on both the summarization and retrieval tasks, over several state-of-the-art methods.
引用
收藏
页码:4174 / 4187
页数:14
相关论文
共 50 条
  • [31] Query-Oriented Data Augmentation for Session Search
    Chen, Haonan
    Dou, Zhicheng
    Zhu, Yutao
    Wen, Ji-Rong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6877 - 6888
  • [32] Query-Oriented Answer Imputation for Aggregate Queries
    Hannou, Fatma-Zohra
    Amann, Bernd
    Baazizi, Mohamed-Amine
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 11695 : 302 - 318
  • [33] A survey of micro-video analysis
    Guo, Jie
    Gong, Rui
    Ma, Yuling
    Liu, Meng
    Xi, Xiaoming
    Nie, Xiushan
    Yin, Yilong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32191 - 32212
  • [34] A survey of micro-video analysis
    Jie Guo
    Rui Gong
    Yuling Ma
    Meng Liu
    Xiaoming Xi
    Xiushan Nie
    Yilong Yin
    Multimedia Tools and Applications, 2024, 83 : 32191 - 32212
  • [35] Query-oriented citation recommendation based on network correlation
    Yang, Libin
    Zheng, Yu
    Cai, Xiaoyan
    Pan, Shirui
    Dai, Tao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (04) : 4621 - 4628
  • [36] Query-Oriented Temporal Active Intimate Community Search
    Anwar, Md Musfique
    DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 206 - 215
  • [37] A Scoring Method of XML Fragments Considering Query-Oriented Statistics
    Keyaki, Atsushi
    Hatano, Kenji
    Miyazaki, Jun
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 456 - +
  • [38] QODM: A Query-Oriented Data Modeling Approach for NoSQL Databases
    Li, Xiang
    Ma, Zhiyi
    Chen, Hongjie
    PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), 2014, : 338 - 345
  • [39] Identification of Query-Oriented Influential Users in Online Social Platform
    Dhali, Aditi
    Gomasta, Sarmistha Sarna
    Mohanta, Sudeepto
    Anwar, Md Musfique
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 973 - 976
  • [40] Research on Video Compression Technology for Micro-Video Applications
    Cai, Dongna
    Li, Yuning
    Li, Zhi
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL II: SIGNAL PROCESSING, 2020, 516 : 157 - 165