General and Task-Oriented Video Segmentation

被引:0
|
作者
Chen, Mu [1 ]
Li, Liulei [1 ]
Wang, Wenguan [2 ]
Quan, Ruijie [2 ]
Yang, Yi [2 ]
机构
[1] Univ Technol Sydney, ReLER Lab, AAII, Ultimo, Australia
[2] Zhejiang Univ, ReLER Lab, CCAI, Hangzhou, Peoples R China
来源
关键词
Video segmentation; General solution; Task-orientation; INSTANCE; TRANSFORMER; ATTENTION; SHAPE;
D O I
10.1007/978-3-031-72667-5_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present GVSEG, a general video segmentation framework for addressing four different video segmentation tasks (i.e., instance, semantic, panoptic, and exemplar-guided) while maintaining an identical architectural design. Currently, there is a trend towards developing general video segmentation solutions that can be applied across multiple tasks. This streamlines research endeavors and simplifies deployment. However, such a highly homogenized framework in current design, where each element maintains uniformity, could overlook the inherent diversity among different tasks and lead to suboptimal performance. To tackle this, GVSEG: i) provides a holistic disentanglement and modeling for segment targets, thoroughly examining them from the perspective of appearance, position, and shape, and on this basis, ii) reformulates the query initialization, matching and sampling strategies in alignment with the task-specific requirement. These architecture-agnostic innovations empower GVSEG to effectively address each unique task by accommodating the specific properties that characterize them. Extensive experiments on seven gold-standard benchmark datasets demonstrate that GVSEG surpasses all existing specialized/general solutions by a significant margin on four different video segmentation tasks.
引用
收藏
页码:72 / 92
页数:21
相关论文
共 50 条
  • [21] Task-oriented Deep Network for Ischemic Stroke Segmentation in Unenhanced CT Imaging
    Wang, Lei
    Li, Sui
    Meng, Mingqiang
    Chen, Gaofeng
    Zhu, Manman
    Bian, Zhaoying
    Lyu, Qingwen
    Zeng, Dong
    Ma, Jianhua
    2019 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2019,
  • [22] Task Modeling for Task-Oriented Robot Programming
    Trapani, Stefano
    Indri, Marina
    2017 22ND IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2017,
  • [23] Research on Task-Oriented Application Design
    Zhou, Chuan-Sheng
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1482 - 1486
  • [24] Learning to Model Task-Oriented Attention
    Zou, Xiaochun
    Zhao, Xinbo
    Wang, Jian
    Yang, Yongjia
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016 : 1 - 12
  • [25] Assessment criteria for task-oriented groups
    Witte, EH
    Lecher, S
    GRUPPENDYNAMIK-ZEITSCHRIFT FUR ANGEWANDTE SOZIALPSYCHOLOGIE, 1998, 29 (03): : 313 - 325
  • [26] Modeling task-oriented discussion groups
    Wilson, R
    USER MODELING 2003, PROCEEDINGS, 2003, 2702 : 248 - 257
  • [27] Landmark selection for task-oriented navigation
    Lerner, Ronen
    Rivlin, Ehud
    Shimshoni, Ilan
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 2785 - 2791
  • [28] Task-Oriented Network for Image Dehazing
    Li, Runde
    Pan, Jinshan
    He, Min
    Li, Zechao
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6523 - 6534
  • [29] A Survey on Task-Oriented Dialogue Systems
    Zhao Y.-Y.
    Wang Z.-Y.
    Wang P.
    Yang T.
    Zhang R.
    Yin K.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (10): : 1862 - 1896
  • [30] TASK-ORIENTED GROUP IN A DAY HOSPITAL
    FULLILOVE, MT
    PACHECO, O
    FOURCHARD, C
    JOURNAL OF THE NATIONAL MEDICAL ASSOCIATION, 1985, 77 (12) : 995 - 998