Author name recognition in degraded journal images

被引:0
|
作者
de la Jacopiere, Aliette de Bodard [1 ]
Likforman-Sulem, Laurence [1 ]
机构
[1] GET Ecole Natl Super Telecommun, Signal & Image Proc Dept, 46 Rue Barrault, F-75013 Paris, France
来源
关键词
degraded documents; journal title pages; neural network; author name; textual and image-based analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method for extracting names in degraded documents is presented in this article. The documents targeted are images of photocopied scientific journals from various scientific domains. Due to the degradation. there is poor OCR recognition, and pieces of other articles appear on the sides of the image. The proposed approach relies on the combination of a low-level textual analysis and an image-based analysis. The textual analysis extracts robust typographic features, while the image analysis selects image regions of interest through anchor components. We report results on the University of Washington benchmark database.
引用
收藏
页数:9
相关论文
共 50 条