VisualDiff: Document Image Verification and Change Detection

被引:5
|
作者
Jain, Rajiv [1 ]
Doermann, David [1 ]
机构
[1] Univ Maryland, Language & Multimedia Proc Lab, College Pk, MD 20742 USA
关键词
Change Detection; Document Image; Document Verification;
D O I
10.1109/ICDAR.2013.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the related problems of verification and change detection in document images. The goal is to determine if two document images differ, and if so, to determine precisely what content may have been added, deleted, or otherwise modified. This problem has many potential applications, especially for important legal documents such as contractual agreements. These agreements are often edited, shared and stored as scanned or hardcopy documents, where small, undetected changes between edits could create major differences in the contractual language and thus have severe repercussions. One can view the problem of change detection as tracing the revision history of a set of documents. Thus, in order to validate the performance of this approach, we created the "Enron Revisions" dataset. This dataset contains realistic revisions obtained from attachments in the Enron Corpus, and a series of before and after snapshots of the revisions in images with varying levels of noise from resolution, binarization, and blur. The approach taken in this paper utilizes the SIFT descriptor to align two document images without the benefit of OCR and once aligned, to compare dense descriptors to determine changes that have occurred within the image. As a baseline, this "VisualDiff" is compared to a UNIX diff-like approach on text extracted through OCR and results demonstrate the effectiveness of this approach.
引用
收藏
页码:40 / 44
页数:5
相关论文
共 50 条
  • [1] Localized Document Image Change Detection
    Jain, Rajiv
    Doermann, David
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 786 - 790
  • [2] Document image analysis and verification using cursive signature
    Chalechale, A
    Naghdy, G
    Premaratne, P
    Mertins, A
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 887 - 890
  • [3] The detection of duplicates in document image databases
    Doermann, D
    Li, HP
    Kia, O
    IMAGE AND VISION COMPUTING, 1998, 16 (12-13) : 907 - 920
  • [4] The detection of duplicates in document image databases
    Doermann, D
    Li, HP
    Kia, O
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 314 - 318
  • [5] Document image similarity and equivalence detection
    Jonathan J. Hull
    International Journal on Document Analysis and Recognition, 1998, 1 (1) : 37 - 42
  • [6] Document image similarity and equivalence detection
    Hull, JJ
    Cullen, JF
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 308 - 312
  • [7] Document image orientation detection based on both text and image
    Sun, Yuejia
    Liu, Changsong
    Ding, Xiaoqing
    Fan, Zhigang
    Tse, Francis
    IMAGING AND PRINTING IN A WEB 2.0 WORLD III, 2012, 8302
  • [8] Signature Detection and Matching for Document Image Retrieval
    Zhu, Guangyu
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (11) : 2015 - 2031
  • [9] DOCUMENT CHANGE DETECTION WITH HIERARCHICAL PATCH COMPARISON
    Park, Doyoung
    Kim, Sunjin
    Kim, Minkyu
    Yarram, Naresh Reddy
    Joe, Seongho
    Gwon, Youngjune
    Choi, Jongwon
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 665 - 669
  • [10] DIGITAL IMAGE CHANGE DETECTION
    FREI, W
    SINGH, M
    SHIBATA, T
    OPTICAL ENGINEERING, 1980, 19 (03) : 331 - 338