Segments Graph-Based Approach for Smartphone Document Capture

被引:1
|
作者
Zhukovsky, Alexander E. [1 ]
Arlazarov, Vladimir V. [2 ]
Postnikov, Vasiliy V. [2 ]
Krivtsov, Valeriy E. [1 ]
机构
[1] Moscow Inst Phys & Technol, Moscow, Russia
[2] Russian Acad Sci, Fed Res Ctr Informat & Control Syst, Inst Syst Anal, Moscow 117901, Russia
关键词
Mobile capture; document capture; segment detection; blackbox optimization; projective transform;
D O I
10.1117/12.2228719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document capture with a smartphone camera is already here to stay. Interactive applications for document capture and its enhancement have filled mobile application stores. However, discounting the predictions and judging only from the experience of using such applications, they are not yet ready to compete with stationary scanners when high quality and reliability is required. This paper is devoted to analysis of the problem of document detection in the image and evaluation of the quality of existing mobile applications. Based on this analysis we present a new reliable algorithm for document capture, based on the boundary segments detection and constructing a segments graph to fit rectangular projective model. The algorithm achieves about 95% quality of document detection and outperforms all of the reviewed algorithms, implemented in mobile applications.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Segments Graph-Based Approach for Document Capture in a Smartphone Video Stream
    Zhukovsky, Alexander
    Nikolaev, Dmitriy
    Arlazarov, Vladimir
    Postnikov, Vasiliy
    Polevoy, Dmitriy
    Skoryukina, Natalya
    Chernov, Timofey
    Shemiakina, Julia
    Mukovozov, Arseniy
    Konovalenko, Ivan
    Povolotsky, Mikhail
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 337 - 342
  • [2] Semantic Graph-Based Approach for Document Organization
    Velazquez-Garcia, Erika
    Lopez-Arevalo, Ivan
    Sosa-Sosa, Victor
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2012, 151 : 469 - 476
  • [3] GRAPH-BASED APPROACH FOR MOTION CAPTURE DATA REPRESENTATION AND ANALYSIS
    Kao, Jiun-Yu
    Ortega, Antonio
    Narayanan, Shrikanth S.
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2061 - 2065
  • [4] Efficient Graph-Based Document Similarity
    Paul, Christian
    Rettinger, Achim
    Mogadala, Aditya
    Knoblock, Craig A.
    Szekely, Pedro
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 334 - 349
  • [5] NERank plus : a graph-based approach for entity ranking in document collections
    Wang, Chengyu
    Zhou, Guomin
    He, Xiaofeng
    Zhou, Aoying
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (03) : 504 - 517
  • [6] NERank+: a graph-based approach for entity ranking in document collections
    Chengyu Wang
    Guomin Zhou
    Xiaofeng He
    Aoying Zhou
    Frontiers of Computer Science, 2018, 12 : 504 - 517
  • [7] GVdoc: Graph-based Visual Document Classification
    Mohbat, Fnu
    Zaki, Mohammed J.
    Finegan-Dollak, Catherine
    Verma, Ashish
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 5342 - 5357
  • [8] A graph-based framework for Web document mining
    Schenker, A
    Bunke, H
    Last, M
    Kandel, A
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 401 - 412
  • [9] Graph-based Document Representation for Relation Extraction
    Cabaleiro, Bernardo
    Penas, Anselmo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 57 - 64
  • [10] Knowledge-Oriented Graph-Based Approach to Capture the Evolution of Developers' Knowledge
    Lucas, Edson
    Oliveira, Toacy
    Alencar, Paulo
    Cowan, Donald
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4609 - 4612