MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation

被引:5
|
作者
Lambert, John [1 ]
Liu, Zhuang [2 ]
Sener, Ozan [3 ]
Hays, James [1 ]
Koltun, Vladlen [3 ]
机构
[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[3] Intel Labs, Santa Clara, CA 95054 USA
关键词
Training; Semantics; Computational modeling; Annotations; Taxonomy; Image segmentation; Benchmark testing; Robust vision; semantic segmentation; instance segmentation; panoptic segmentation; domain generalization;
D O I
10.1109/TPAMI.2022.3151200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images, requiring more than 1.34 years of collective annotator effort. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash-v1 leaderboard for robust semantic segmentation, with no exposure to WildDash data during training. We evaluate our models in the 2020 Robust Vision Challenge (RVC) as an extreme generalization experiment. MSeg training sets include only three of the seven datasets in the RVC; more importantly, the evaluation taxonomy of RVC is different and more detailed. Surprisingly, our model shows competitive performance and ranks second. To evaluate how close we are to the grand aim of robust, efficient, and complete scene understanding, we go beyond semantic segmentation by training instance segmentation and panoptic segmentation models using our dataset. Moreover, we also evaluate various engineering design decisions and metrics, including resolution and computational efficiency. Although our models are far from this grand aim, our comprehensive evaluation is crucial for progress. We share all the models and code with the community.
引用
收藏
页码:796 / 810
页数:15
相关论文
共 50 条
  • [41] Semantic Vector Space Mapping for Edge of Network Multi-Domain Operations
    Bent, Graham
    Summers-Stay, Douglas
    Preece, Alun
    Li, Yuhua
    Davies, Lewys
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [42] Fuzzy Semantic Classification of Multi-Domain E-Learning Concept
    Ahmed, Rafeeq
    Ahmad, Tanvir
    Almutairi, Fadiyah M.
    Qahtani, Abdulrahman M.
    Alsufyani, Abdulmajeed
    Almutiry, Omar
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (05): : 2206 - 2215
  • [43] Ticino: A multi-modal remote sensing dataset for semantic segmentation
    Barbato, Mirko Paolo
    Piccoli, Flavio
    Napoletano, Paolo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [44] MULTI-DOMAIN RETRIEVAL OF GEOSPATIAL DATA SOURCES IMPLEMENTING A SEMANTIC CATALOGUE
    Romeo Vizcarra, Julio
    Cruz, Christophe
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY (IE 2015): EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2015, : 582 - 586
  • [45] Fuzzy Semantic Classification of Multi-Domain E-Learning Concept
    Rafeeq Ahmed
    Tanvir Ahmad
    Fadiyah M. Almutairi
    Abdulrahman M. Qahtani
    Abdulmajeed Alsufyani
    Omar Almutiry
    Mobile Networks and Applications, 2021, 26 : 2206 - 2215
  • [46] Auto-generated Wires Dataset for Semantic Segmentation with Domain-Independence
    Zanella, Riccardo
    Caporali, Alessio
    Tadaka, Kalyan
    De Gregorio, Daniele
    Palli, Gianluca
    2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 292 - 298
  • [47] MCER: A Multi-domain Dataset for Sentence-Level Chinese Ellipsis Resolution
    Qi, Jialu
    Shao, Yanqiu
    Li, Wei
    Shen, Zizhuo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 29 - 42
  • [48] A Composite Noise Removal Network Based on Multi-domain Adaptation
    Bai, Fan
    Li, Pengfei
    Sun, Haoyang
    Zhang, Hui
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 1194 - 1205
  • [49] A pothole video dataset for semantic segmentation
    Ihsan, Muhammad
    Amrizal, Muhammad Alfian
    Harjoko, Agus
    DATA IN BRIEF, 2024, 53
  • [50] An enhanced sentiment dictionary for domain adaptation with multi-domain dataset in Tamil language (ESD-DA)
    Sivasankar, E.
    Krishnakumari, K.
    Balasubramanian, P.
    SOFT COMPUTING, 2021, 25 (05) : 3697 - 3711