MT-VQA: A Multi-task Approach for Quality Assessment of Short-form Videos

被引:0
|
作者
Wen, Shijie [1 ]
Qiao, Minglang [1 ]
Jiang, Lai [1 ]
Xu, Mai [1 ]
Deng, Xin [1 ]
Li, Shengxi [1 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
基金
北京市自然科学基金;
关键词
Short-form video; video quality assessment; human attention; SALIENCY; PREDICTION;
D O I
10.1145/3689093.3689181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short-form video, as a mainstream media form on video platforms, has undergone explosive growth in recent years. A vast number of short-form videos are produced, processed, and distributed to users each day, inevitably leading to quality degradation. Therefore, accurate video quality assessment (VQA) is critical for monitoring and optimizing the viewing experience of users. However, the existing short-form VQA approaches neglect human attention patterns during the viewing of videos. Besides, the advancement of short-form VQA is obstructed by the absence of large-scale datasets. To tackle the above challenges, we first construct a large-scale short-form VQA dataset called SVQA. The SVQA dataset comprises diverse distortion types, covering the typical quality degradations that arise during the photography, encoding, and editing of short-form videos. Besides, for each short-form video in SVQA, we collect both quality score and eye-tracking annotation. Based on our dataset, we propose a two-branch multi-task VQA approach, MT-VQA, in which both tasks of VQA and video saliency prediction (VSP) can be accomplished for short-form videos. We further propose a saliency fusion module to guide the VQA branch to focus on quality distortions within visually attractive regions. Extensive experiments show that our multi-task approach achieves superior performance in both VQA and VSP tasks.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 50 条
  • [31] NON-INTRUSIVE SPEECH QUALITY ASSESSMENT WITH MULTI-TASK LEARNING BASED ON TENSOR NETWORK
    Liu, Hanyue
    Liu, Miao
    Wang, Jing
    Xie, Xiang
    Yang, Lidong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 851 - 855
  • [32] Chest CT-IQA: A multi-task model for chest CT image quality assessment and classification
    Xun, Siyi
    Jiang, Mingfeng
    Huang, Pu
    Sun, Yue
    Li, Dengwang
    Luo, Yan
    Zhang, Huifen
    Zhang, Zhicheng
    Liu, Xiaohong
    Wu, Mingxiang
    Tan, Tao
    DISPLAYS, 2024, 84
  • [33] CONTENT-DEPENDENCY REDUCTION WITH MULTI-TASK LEARNING IN BLIND STITCHED PANORAMIC IMAGE QUALITY ASSESSMENT
    Hou, Jingwen
    Lin, Weisi
    Zhao, Baoquan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3463 - 3467
  • [34] Dual-Channel Multi-Task CNN for No-Reference Screen Content Image Quality Assessment
    Zhang, Chaofan
    Huang, Ziqing
    Liu, Shiguang
    Xiao, Jian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5011 - 5025
  • [35] A MULTI-TASK CONVOLUTIONAL NEURAL NETWORK FOR BLIND STEREOSCOPIC IMAGE QUALITY ASSESSMENT USING NATURALNESS ANALYSIS
    Bourbia, Salima
    Karine, Ayoub
    Chetouani, Aladine
    El Hassoun, Mohammed
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1434 - 1438
  • [36] No-reference stereoscopic image quality assessment using a multi-task CNN and registered distortion representation
    Shi, Yiqing
    Guo, Wenzhong
    Niu, Yuzhen
    Zhan, Jiamei
    PATTERN RECOGNITION, 2020, 100
  • [37] RATING-AUGMENTED NO-REFERENCE POINT CLOUD QUALITY ASSESSMENT USING MULTI-TASK LEARNING
    Wang, Xinyu
    Wang, Xiaochuan
    Liu, Ruijun
    Huang, Xiankai
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4320 - 4324
  • [38] A Multi-Task Approach for Real-Time Quality of Experience Factors Prediction from Physiological Data
    Begue, Joshua
    Labiod, Mohamed Aymen
    Mellouk, Abdelhamid
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 5853 - 5858
  • [39] A Semi-Supervised Multi-Task Learning Approach for Predicting Short-Term Kidney Disease Evolution
    Bernardini, Michele
    Romeo, Luca
    Frontoni, Emanuele
    Amini, Massih-Reza
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (10) : 3983 - 3994
  • [40] Deep learning-driven multi-view multi-task image quality assessment method for chest CT image
    Jialin Su
    Meifang Li
    Yongping Lin
    Liu Xiong
    Caixing Yuan
    Zhimin Zhou
    Kunlong Yan
    BioMedical Engineering OnLine, 22