General and Task-Oriented Video Segmentation

被引:0
|
作者
Chen, Mu [1 ]
Li, Liulei [1 ]
Wang, Wenguan [2 ]
Quan, Ruijie [2 ]
Yang, Yi [2 ]
机构
[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia
[2] Zhejiang Univ, ReLER Lab, CCAI, Hangzhou, Peoples R China
来源
关键词
Video segmentation; General solution; Task-orientation; INSTANCE; TRANSFORMER; ATTENTION; SHAPE;
D O I
10.1007/978-3-031-72667-5_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present GVSEG, a general video segmentation framework for addressing four different video segmentation tasks (i.e., instance, semantic, panoptic, and exemplar-guided) while maintaining an identical architectural design. Currently, there is a trend towards developing general video segmentation solutions that can be applied across multiple tasks. This streamlines research endeavors and simplifies deployment. However, such a highly homogenized framework in current design, where each element maintains uniformity, could overlook the inherent diversity among different tasks and lead to suboptimal performance. To tackle this, GVSEG: i) provides a holistic disentanglement and modeling for segment targets, thoroughly examining them from the perspective of appearance, position, and shape, and on this basis, ii) reformulates the query initialization, matching and sampling strategies in alignment with the task-specific requirement. These architecture-agnostic innovations empower GVSEG to effectively address each unique task by accommodating the specific properties that characterize them. Extensive experiments on seven gold-standard benchmark datasets demonstrate that GVSEG surpasses all existing specialized/general solutions by a significant margin on four different video segmentation tasks.
引用
收藏
页码:72 / 92
页数:21
相关论文
共 50 条
  • [1] Video Enhancement with Task-Oriented Flow
    Xue, Tianfan
    Chen, Baian
    Wu, Jiajun
    Wei, Donglai
    Freeman, William T.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (08) : 1106 - 1125
  • [2] Video Enhancement with Task-Oriented Flow
    Tianfan Xue
    Baian Chen
    Jiajun Wu
    Donglai Wei
    William T. Freeman
    International Journal of Computer Vision, 2019, 127 : 1106 - 1125
  • [3] Task-Oriented Video Compressive Streaming for Real-Time Semantic Segmentation
    Xiao, Xuedou
    Zuo, Yingying
    Yan, Mingxuan
    Wang, Wei
    He, Jianhua
    Zhang, Qian
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 14396 - 14413
  • [4] Task-Oriented Communication for Edge Video Analytics
    Shao, Jiawei
    Zhang, Xinjie
    Zhang, Jun
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 4141 - 4154
  • [5] TASK-ORIENTED CAMERA ASSIGNMENT IN A VIDEO NETWORK
    Li, Yiming
    Bhanu, Bir
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3473 - 3476
  • [6] Transmitting What Matters: Task-oriented video composition and compression
    Andalo, Fernanda A.
    Penatti, Otavio A. B.
    Testoni, Vanessa
    2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 72 - 79
  • [7] TASK-ORIENTED ARCHITECTURES
    BISIANI, R
    MAUERSBERG, H
    REDDY, R
    PROCEEDINGS OF THE IEEE, 1983, 71 (07) : 885 - 898
  • [8] A TASK-ORIENTED LUNCH GROUP
    FULLILOVE, M
    HOSPITAL AND COMMUNITY PSYCHIATRY, 1984, 35 (10): : 1078 - 1078
  • [9] A survey of task-oriented crowdsourcing
    Nuno Luz
    Nuno Silva
    Paulo Novais
    Artificial Intelligence Review, 2015, 44 : 187 - 213
  • [10] Task-oriented learning on the Web
    Whittington, CD
    Campbell, LM
    INNOVATIONS IN EDUCATION AND TRAINING INTERNATIONAL, 1999, 36 (01): : 26 - 33