Synthetic Document Images with Diverse Shadows for Deep Shadow Removal Networks

被引:0
|
作者
Matsuo, Yuhi [1 ]
Aoki, Yoshimitsu [1 ]
机构
[1] Keio Univ, Fac Sci & Technol, Dept Elect Engn, 3-14-1 Hiyoshi,Kohoku Ku, Yokohama, Kanagawa 2238522, Japan
关键词
shadow removal; document images; deep neural networks;
D O I
10.3390/s24020654
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Shadow removal for document images is an essential task for digitized document applications. Recent shadow removal models have been trained on pairs of shadow images and shadow-free images. However, obtaining a large, diverse dataset for document shadow removal takes time and effort. Thus, only small real datasets are available. Graphic renderers have been used to synthesize shadows to create relatively large datasets. However, the limited number of unique documents and the limited lighting environments adversely affect the network performance. This paper presents a large-scale, diverse dataset called the Synthetic Document with Diverse Shadows (SynDocDS) dataset. The SynDocDS comprises rendered images with diverse shadows augmented by a physics-based illumination model, which can be utilized to obtain a more robust and high-performance deep shadow removal network. In this paper, we further propose a Dual Shadow Fusion Network (DSFN). Unlike natural images, document images often have constant background colors requiring a high understanding of global color features for training a deep shadow removal network. The DSFN has a high global color comprehension and understanding of shadow regions and merges shadow attentions and features efficiently. We conduct experiments on three publicly available datasets, the OSR, Kligler's, and Jung's datasets, to validate our proposed method's effectiveness. In comparison to training on existing synthetic datasets, our model training on the SynDocDS dataset achieves an enhancement in the PSNR and SSIM, increasing them from 23.00 dB to 25.70 dB and 0.959 to 0.971 on average. In addition, the experiments demonstrated that our DSFN clearly outperformed other networks across multiple metrics, including the PSNR, the SSIM, and its impact on OCR performance.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A Statistical Approach for Multi-frame Shadow Movement Detection and Shadow Removal for Document Capture
    Mondal, Prasenjit
    Bal, Ankit
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 508 - 512
  • [42] Structure-Informed Shadow Removal Networks
    Liu, Yuhao
    Guo, Qing
    Fu, Lan
    Ke, Zhanghan
    Xu, Ke
    Feng, Wei
    Tsang, Ivor W.
    Lau, Rynson W. H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5823 - 5836
  • [43] Document images classification based on deep learning
    Hu, Biao
    Ergu, Daji
    Yang, Huan
    Liu, Kuiyi
    Cai, Ying
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 514 - 522
  • [44] Document Image Shadow Removal Guided by Color-Aware Background
    Zhang, Ling
    He, Yinghao
    Zhang, Qing
    Liu, Zheng
    Zhang, Xiaolong
    Xiao, Chunxia
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1818 - 1827
  • [45] Water-Filling: An Efficient Algorithm for Digitized Document Shadow Removal
    Jung, Seungjun
    Abul Hasan, Muhammad
    Kim, Changick
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 398 - 414
  • [46] Region growing shadow segmentation in Synthetic Aperture Radar images
    Wilson, KS
    Power, GJ
    CISST'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, VOLS I AND II, 2000, : 37 - 41
  • [47] Synthetic Dataset of Electroluminescence Images of Photovoltaic Cells by Deep Convolutional Generative Adversarial Networks
    Mateo Romero, Hector Felipe
    Gonzalez Rebollo, Miguel Angel
    Cardenoso-Payo, Valentin
    Alonso Gomez, Victor
    Jose Bello, Hugo
    Redondo Plaza, Alberto
    Hernandez Callejo, Luis
    SMART CITIES, ICSC-CITIES 2022, 2023, 1706 : 3 - 16
  • [48] Synthetic Dataset of Electroluminescence Images of Photovoltaic Cells by Deep Convolutional Generative Adversarial Networks
    Mateo Romero, Hector Felipe
    Hernandez-Callejo, Luis
    Gonzalez Rebollo, Miguel Angel
    Cardenoso-Payo, Valentin
    Alonso Gomez, Victor
    Jose Bello, Hugo
    Moyo, Ranganai Tawanda
    Morales Aragones, Jose Ignacio
    SUSTAINABILITY, 2023, 15 (09)
  • [49] Automated detection and removal of clouds and their shadows from landsat TM images
    Wang, B
    Ono, A
    Muramatsu, K
    Fujiwara, N
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (02) : 453 - 460
  • [50] Circular noises removal from scanned document images
    Meng, Gaofeng
    Zheng, Nanning
    Zhang, Yuanlin
    Song, Yonghong
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 183 - 187