General and Task-Oriented Video Segmentation

被引:0
|
作者
Chen, Mu [1 ]
Li, Liulei [1 ]
Wang, Wenguan [2 ]
Quan, Ruijie [2 ]
Yang, Yi [2 ]
机构
[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia
[2] Zhejiang Univ, ReLER Lab, CCAI, Hangzhou, Peoples R China
来源
关键词
Video segmentation; General solution; Task-orientation; INSTANCE; TRANSFORMER; ATTENTION; SHAPE;
D O I
10.1007/978-3-031-72667-5_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present GVSEG, a general video segmentation framework for addressing four different video segmentation tasks (i.e., instance, semantic, panoptic, and exemplar-guided) while maintaining an identical architectural design. Currently, there is a trend towards developing general video segmentation solutions that can be applied across multiple tasks. This streamlines research endeavors and simplifies deployment. However, such a highly homogenized framework in current design, where each element maintains uniformity, could overlook the inherent diversity among different tasks and lead to suboptimal performance. To tackle this, GVSEG: i) provides a holistic disentanglement and modeling for segment targets, thoroughly examining them from the perspective of appearance, position, and shape, and on this basis, ii) reformulates the query initialization, matching and sampling strategies in alignment with the task-specific requirement. These architecture-agnostic innovations empower GVSEG to effectively address each unique task by accommodating the specific properties that characterize them. Extensive experiments on seven gold-standard benchmark datasets demonstrate that GVSEG surpasses all existing specialized/general solutions by a significant margin on four different video segmentation tasks.
引用
收藏
页码:72 / 92
页数:21
相关论文
共 50 条
  • [41] Mutation Testing for Task-Oriented Chatbots
    Gomez-Abajo, Pablo
    Perez-Soler, Sara
    Canizares, Pablo C.
    Guerra, Esther
    de lara, Juan
    PROCEEDINGS OF 2024 28TH INTERNATION CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2024, 2024, : 232 - 241
  • [42] Knowledge discovery in task-oriented dialogue
    Puppi Wanderley, Gregory Moro
    Tacla, Cesar Augusto
    Barthes, Jean-Paul A.
    Paraiso, Emerson Cabrera
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (20) : 6807 - 6818
  • [43] Software architectures for task-oriented computing
    Garlan, David
    Software Architecture, Proceedings, 2007, 4758 : 1 - 1
  • [44] Task-oriented maximally entangled states
    Agrawal, Pankaj
    Pradhan, B.
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2010, 43 (23)
  • [45] Strategic Decisions in Task-Oriented Reading
    Salmeron, Ladislao
    Vidal-Abarca, Eduardo
    Martinez, Tomas
    Mana, Amelia
    Gil, Laura
    Naumann, Johannes
    SPANISH JOURNAL OF PSYCHOLOGY, 2015, 18
  • [46] TOA: Task-oriented Active VQA
    Xing, Xiaoying
    Liang, Mingfu
    Wu, Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] Task-Oriented Communication Design at Scale
    Mostaani, Arsham
    Vu, Thang X.
    Habibi, Hamed
    Chatzinotas, Symeon
    Ottersten, Bjorn
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2025, 73 (01) : 378 - 393
  • [48] Task-Oriented Explainable Semantic Communications
    Ma, Shuai
    Qiao, Weining
    Wu, Youlong
    Li, Hang
    Shi, Guangming
    Gao, Dahua
    Shi, Yuanming
    Li, Shiyin
    Al-Dhahir, Naofal
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9248 - 9262
  • [49] Task-Oriented Rehabilitation Program for Stroke
    Peters, Heather Tanksley
    Page, Stephen J.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2016, 316 (01): : 101 - 102
  • [50] ESTIMATION OF TASK-ORIENTED TIME IN NEUROTICS
    CHATTERJEA, RG
    CHATTERJEE, PK
    BISWAS, PK
    BASU, AK
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 1980, 32 (01) : 59 - 63