Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation

被引:0
|
作者
Quang Nguyen [1 ,2 ]
Truong Vu [1 ]
Anh Tran [1 ]
Khoi Nguyen [1 ]
机构
[1] VinAI Res, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Technol, VNU HCM, Ho Chi Minh City, Vietnam
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category labels, we propose a novel method for generating pixel-level semantic segmentation labels using the text-to-image generative model Stable Diffusion (SD). By utilizing the text prompts, cross-attention, and self-attention of SD, we introduce three new techniques: class-prompt appending, class-prompt cross-attention, and self-attention exponentiation. These techniques enable us to generate segmentation maps corresponding to synthetic images. These maps serve as pseudo-labels for training semantic segmenters, eliminating the need for labor-intensive pixel-wise annotation. To account for the imperfections in our pseudo-labels, we incorporate uncertainty regions into the segmentation, allowing us to disregard loss from those regions. We conduct evaluations on two datasets, PASCAL VOC and MSCOCO, and our approach significantly outperforms concurrent work. Our benchmarks and code will be released at https://github.com/VinAIResearch/Dataset-Diffusion.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Lightweight pixel-level semantic segmentation and analysis for sewer defects using deep learning
    Dang, L. Minh
    Wang, Hanxiang
    Li, Yanfen
    Nguyen, Le Quan
    Nguyen, Tan N.
    Song, Hyoung-Kyu
    Moon, Hyeonjoon
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 371
  • [32] Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation
    Du, Ye
    Fu, Zehua
    Liu, Qingjie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4654 - 4669
  • [33] Weakly Supervised Semantic Segmentation in Aerial Imagery via Explicit Pixel-Level Constraints
    Zhou, Ruixue
    Zhang, Wenkai
    Yuan, Zhiqiang
    Rong, Xuee
    Liu, Wenjie
    Fu, Kun
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [34] Generation and study of the synthetic brain electron microscopy dataset for segmentation purpose
    Sokolov, N. A.
    Vasiliev, E. P.
    Getmanskaya, A. A.
    COMPUTER OPTICS, 2023, 47 (05) : 778 - 787
  • [35] Diffusion-Based Data Augmentation for Nuclei Image Segmentation
    Yu, Xinyi
    Li, Guanbin
    Lou, Wei
    Liu, Siqi
    Wan, Xiang
    Chen, Yan
    Li, Haofeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 592 - 602
  • [36] TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection
    Yang, Wenkui
    Zhang, Zhida
    Zhou, Xiaogiang
    Duan, Junxian
    Cao, Jie
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 429 - 443
  • [37] Pixel-level damage detection for concrete spalling and rebar corrosion based on U-net semantic segmentation
    Xu, Y.
    Qiao, W. D.
    Bao, Y. Q.
    Li, H.
    Zhang, Y. F.
    BRIDGE MAINTENANCE, SAFETY, MANAGEMENT, LIFE-CYCLE SUSTAINABILITY AND INNOVATIONS, 2021, : 3319 - 3326
  • [38] A pixel-wise annotated dataset of small overlooked indoor objects for semantic segmentation applications
    Mohamed, Elhassan
    Sirlantzis, Konstantinos
    Howells, Gareth
    DATA IN BRIEF, 2022, 40
  • [39] Landslide detection based on pixel-level contrastive learning for semi-supervised semantic segmentation in wide areas
    Lv, Jichao
    Zhang, Rui
    Wu, Renzhe
    Bao, Xin
    Liu, Guoxiang
    LANDSLIDES, 2025, 22 (04) : 1087 - 1105
  • [40] MedWGAN based synthetic dataset generation for Uveitis pathology
    Sliman, Heithem
    Megdiche, Imen
    Alajramy, Loay
    Taweel, Adel
    Yangui, Sami
    Drira, Aida
    Lamine, Elyes
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18