MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation

被引:5
|
作者
Lambert, John [1 ]
Liu, Zhuang [2 ]
Sener, Ozan [3 ]
Hays, James [1 ]
Koltun, Vladlen [3 ]
机构
[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[3] Intel Labs, Santa Clara, CA 95054 USA
关键词
Training; Semantics; Computational modeling; Annotations; Taxonomy; Image segmentation; Benchmark testing; Robust vision; semantic segmentation; instance segmentation; panoptic segmentation; domain generalization;
D O I
10.1109/TPAMI.2022.3151200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images, requiring more than 1.34 years of collective annotator effort. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash-v1 leaderboard for robust semantic segmentation, with no exposure to WildDash data during training. We evaluate our models in the 2020 Robust Vision Challenge (RVC) as an extreme generalization experiment. MSeg training sets include only three of the seven datasets in the RVC; more importantly, the evaluation taxonomy of RVC is different and more detailed. Surprisingly, our model shows competitive performance and ranks second. To evaluate how close we are to the grand aim of robust, efficient, and complete scene understanding, we go beyond semantic segmentation by training instance segmentation and panoptic segmentation models using our dataset. Moreover, we also evaluate various engineering design decisions and metrics, including resolution and computational efficiency. Although our models are far from this grand aim, our comprehensive evaluation is crucial for progress. We share all the models and code with the community.
引用
收藏
页码:796 / 810
页数:15
相关论文
共 50 条
  • [21] Multi-domain adaptation for cross-domain semantic slot filling
    Zhang, Yuhui
    Chen, Li
    Ju, Shenggen
    Liu, Gaoshuo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [22] WikiAsp: A Dataset for Multi-domain Aspect-based Summarization
    Hayashi, Hiroaki
    Budania, Prashant
    Wang, Peng
    Ackerson, Chris
    Neervannan, Raj
    Neubig, Graham
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 (09) : 211 - 225
  • [23] MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset
    Liu, Yahui
    Yang, Haoping
    Gong, Chen
    Xia, Qingrong
    Li, Zhenghua
    Zhang, Min
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1707 - 1717
  • [24] Diffusion and Multi-Domain Adaptation Methods for Eosinophil Segmentation
    Lin, Kevin
    Brown, Donald
    Syed, Sana
    Greene, Adam
    PROCEEDINGS OF THE 2024 THE 7TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, ICMVA 2024, 2024, : 150 - 153
  • [25] Person Foreground Segmentation by Learning Multi-Domain Networks
    Liang, Zhiyuan
    Guo, Kan
    Li, Xiaobo
    Jin, Xiaogang
    Shen, Jianbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 585 - 597
  • [26] Adversarial Continual Learning for Multi-domain Hippocampal Segmentation
    Memmel, Marius
    Gonzalez, Camila
    Mukhopadhyay, Anirban
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH (DART 2021), 2021, 12968 : 35 - 45
  • [27] MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving
    Li, Jiale
    Dai, Hang
    Han, Hao
    Ding, Yong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21694 - 21704
  • [28] Is Semantic Communication Secure? A Tale of Multi-Domain Adversarial Attacks
    Sagduyu, Yalin E.
    Erpek, Tugba
    Ulukus, Sennur
    Yener, Aylin
    IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (11) : 50 - 55
  • [29] MENSA: Multi-Dataset Harmonized Pretraining for Semantic Segmentation
    Shi, Bowen
    Zhang, Xiaopeng
    Wang, Yaoming
    Dai, Wenrui
    Zou, Junni
    Xiong, Hongkai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2127 - 2140
  • [30] Linking semantic and knowledge representations in a multi-domain dialogue system
    Dzikovska, Myroslava O.
    Allen, James F.
    Swift, Mary D.
    JOURNAL OF LOGIC AND COMPUTATION, 2008, 18 (03) : 405 - 430