Eigenspace method for text retrieval in historical document images

被引:0
|
作者
Terasawa, K [1 ]
Nagasaki, T [1 ]
Kawashima, T [1 ]
机构
[1] Future Univ Hakodate, Sch Syst Informat Sci, Hakodate, Hokkaido 0418655, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new method for text retrieval that does not need segmentation is described. Segmenting the images in historical documents into individual characters is difficult. Therefore, the conventional OCR method, which uses segmentation, does not work well. Our method instead divides the text image into a sequence of small slits. The image region that corresponds to the query image region is retrieved by solving the matching problem of these sequences. Applying the eigenspace method to the slit images enables us to solve the matching problem efficiently. Moreover using dynamic time warping (DTW) further improves the results. Our method has higher accuracy than the simple template matching method, and it has far higher efficiency in computational cost.
引用
收藏
页码:437 / 441
页数:5
相关论文
共 50 条
  • [1] An effective method for text line segmentation in historical document images
    Tien-Nam Nguyen
    Burie, Jean-Christophe
    Thi-Lan Le
    Schweyer, Anne-Valerie
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1593 - 1599
  • [2] Text line extraction for historical document images
    Saabni, Raid
    Asi, Abedelkadir
    El-Sana, Jihad
    PATTERN RECOGNITION LETTERS, 2014, 35 : 23 - 33
  • [3] Text segmentation in degraded historical document images
    Kavitha, A. S.
    Shivakumara, P.
    Kumar, G. H.
    Lu, Tong
    EGYPTIAN INFORMATICS JOURNAL, 2016, 17 (02) : 189 - 197
  • [4] VESSELNESS FOR TEXT DETECTION IN HISTORICAL DOCUMENT IMAGES
    Hofmann, Simon
    Gropp, Martin
    Bernecker, David
    Pollin, Christopher
    Maier, Andreas
    Christlein, Vincent
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3259 - 3263
  • [5] Text extraction method for historical Tibetan document images based on block projections
    Duan L.-J.
    Zhang X.-Q.
    Ma L.-L.
    Wu J.
    Optoelectronics Letters, 2017, 13 (6) : 457 - 461
  • [6] Text extraction method for historical Tibetan document images based on block projections
    段立娟
    张西群
    马龙龙
    吴健
    OptoelectronicsLetters, 2017, 13 (06) : 457 - 461
  • [7] Adaptive color document images binarization for text retrieval
    Yi, L
    Wang, ZY
    Zeng, HZ
    DOCUMENT REGOGNITION AND RETRIEVAL XI, 2004, 5296 : 35 - 44
  • [8] A keyword retrieval system for historical Mongolian document images
    Wei, Hongxi
    Gao, Guanglai
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (01) : 33 - 45
  • [9] Visual information retrieval from historical document images
    Zhalehpour, Sara
    Arabnejad, Ehsan
    Wellmon, Chad
    Piper, Andrew
    Cheriet, Mohamed
    JOURNAL OF CULTURAL HERITAGE, 2019, 40 : 99 - 112
  • [10] A keyword retrieval system for historical Mongolian document images
    Hongxi Wei
    Guanglai Gao
    International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 33 - 45