Word and sentence extraction using irregular pyramid

被引:0
|
作者
Loo, PK [1 ]
Tan, CL
机构
[1] Singapore Polytech, Sch Built Environm & Design, Singapore 139651, Singapore
[2] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the result of our continued work on a further enhancement to our previous proposed algorithm. Moving beyond the extraction of word groups and based on the same irregular pyramid structure the new proposed algorithm groups the extracted words into sentences. The uniqueness of the algorithm is in its ability to process text of a wide variation in terms of size, font, orientation and layout on the same document image. No assumption is made on any specified document type. The algorithm is based on the irregular pyramid structure with the application. of four fundamental concepts. The first is the inclusion of background information. The second is the concept of closeness where text information within a group is close to each other, in terms of spatial distance, as compared to other text areas. The third is the "majority win" strategy that is more suitable under the greatly varying environment than a constant threshold value. The final concept is the uniformity and continuity among words belonging to the same sentence.
引用
收藏
页码:307 / 318
页数:12
相关论文
共 50 条
  • [21] Perception-based image segmentation using the Bounded Irregular Pyramid
    Marfil, Rebeca
    Bandera, Antonio
    Sandoval, Francisco
    PATTERN RECOGNITION, PROCEEDINGS, 2007, 4713 : 244 - +
  • [22] 3D Image Segmentation Using the Bounded Irregular Pyramid
    Torres, Fuensanta
    Marfil, Rebeca
    Bandera, Antonio
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 979 - 986
  • [23] Unsupervised Relation Extraction Using Sentence Encoding
    Ali, Manzoor
    Saleem, Muhammad
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB: ESWC 2021 SATELLITE EVENTS, 2021, 12739 : 136 - 140
  • [24] Alternative strategies for irregular pyramid construction
    Ip, HHS
    Lam, SWC
    IMAGE AND VISION COMPUTING, 1996, 14 (04) : 297 - 303
  • [25] WORD PERCEPTION AND MISPERCEPTION IN WORD AND SENTENCE CONTEXT
    POTTER, MC
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1989, 27 (06) : 489 - 489
  • [26] AN IRREGULAR HORN SENTENCE IN SUBMODULE LATTICES
    CZEDLI, G
    HUTCHINSON, G
    ACTA SCIENTIARUM MATHEMATICARUM, 1987, 51 (1-2): : 35 - 38
  • [27] Processing of Irregular Polysemes in Sentence Reading
    Brocher, Andreas
    Foraker, Stephani
    Koenig, Jean-Pierre
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2016, 42 (11) : 1798 - 1813
  • [28] Word and Word Order: From Word and Collocation to Sentence and Word Order
    Kacala, Jan
    ESLAVISTICA COMPLUTENSE, 2012, 12 : 87 - 95
  • [29] Improving Thai Word and Sentence Segmentation Using Linguistic Knowledge
    Nararatwong, Rungsiman
    Kertkeidkachorn, Natthawut
    Cooharojananone, Nagul
    Okada, Hitoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 3218 - 3225
  • [30] Detecting ongoing events using contextual word and sentence embeddings
    Maisonnave, Mariano
    Delbianco, Fernando
    Tohme, Fernando
    Maguitman, Ana
    Milios, Evangelos
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 209