DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

被引:0
|
作者
Gu, Xiuye [1 ]
Cui, Yin [1 ,2 ]
Huang, Jonathan [1 ]
Rashwan, Abdullah [1 ]
Yang, Xuan [1 ]
Zhou, Xingyi [1 ]
Ghiasi, Golnaz [1 ]
Kuo, Weicheng [1 ]
Chen, Huizhong [1 ]
Chen, Liang-Chieh [1 ,3 ]
Ross, David [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] NVIDIA, Santa Clara, CA USA
[3] ByteDance, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Observing the close relationship among panoptic, semantic and instance segmentation tasks, we propose to train a universal multi-dataset multi-task segmentation model: DaTaSeg. We use a shared representation (mask proposals with class predictions) for all tasks. To tackle task discrepancy, we adopt different merge operations and post-processing for different tasks. We also leverage weak-supervision, allowing our segmentation model to benefit from cheaper bounding box annotations. To share knowledge across datasets, we use text embeddings from the same semantic embedding space as classifiers and share all network parameters among datasets. We train DaTaSeg on ADE semantic, COCO panoptic, and Objects365 detection datasets. DaTaSeg improves performance on all datasets, especially small-scale datasets, achieving 54.0 mIoU on ADE semantic and 53.5 PQ on COCO panoptic. DaTaSeg also enables weakly-supervised knowledge transfer on ADE panoptic and Objects365 instance segmentation. Experiments show DaTaSeg scales with the number of training datasets and enables open-vocabulary segmentation through direct transfer. In addition, we annotate an Objects365 instance segmentation set of 1,000 images and release it as a public evaluation benchmark on https://laoreja.github.io/dataseg.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Multi-task Learning for Brain Tumor Segmentation
    Weninger, Leon
    Liu, Qianyu
    Merhof, Dorit
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 327 - 337
  • [22] Joint multi-task cascade for instance segmentation
    Yaole Wen
    Fuyuan Hu
    Jinchang Ren
    Xinru Shang
    Linyan Li
    Xuefeng Xi
    Journal of Real-Time Image Processing, 2020, 17 : 1983 - 1989
  • [23] Multi-task learning framework for echocardiography segmentation
    Monkam, Patrice
    Jin, Songbai
    Lu, Wenkai
    2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [24] Multi-task Learning Based Skin Segmentation
    Tan, Taizhe
    Shan, Zhenghao
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023, 2023, 14119 : 360 - 369
  • [25] Joint multi-task cascade for instance segmentation
    Wen, Yaole
    Hu, Fuyuan
    Ren, Jinchang
    Shang, Xinru
    Li, Linyan
    Xi, Xuefeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1983 - 1989
  • [26] Multi-GlaucNet: A multi-task model for optic disc segmentation, blood vessel segmentation and glaucoma detection
    Xiong, Haoren
    Long, Fei
    Alam, Mohammad S.
    Sang, Jun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [27] A loss-balanced multi-task model for simultaneous detection and segmentation
    Zhang, Wenwen
    Wang, Kunfeng
    Wang, Yutong
    Yan, Lan
    Wang, Fei-Yue
    NEUROCOMPUTING, 2021, 428 : 65 - 78
  • [28] MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
    Wagener, Nolan
    Kolobov, Andrey
    Frujeri, Felipe Vieira
    Loynd, Ricky
    Cheng, Ching-An
    Hausknecht, Matthew
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Source CSI Dataset for Multi-Task CSI Feedback
    Inoue, Mayuko
    Ohtsuki, Tomoaki
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1042 - 1043
  • [30] A multi-scale, multi-task fusion UNet model for accurate breast tumor segmentation
    Dai, Shuo
    Liu, Xueyan
    Wei, Wei
    Yin, Xiaoping
    Qiao, Lishan
    Wang, Jianing
    Zhang, Yu
    Hou, Yan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 258