Content-Based Document Image Retrieval Based on Document Modeling

被引:0
|
作者
Chwan-Yi Shiah
机构
[1] Fo Guang University,Department of Applied Informatics
关键词
Document modeling; Language model; Document image retrieval; Multinomial distribution; -gram model;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, language models have gained importance in the field of information retrieval. In this paper, we propose a generic language model to improve a content-based document retrieval system. In this approach, character images are extracted, clustered, and analyzed to form high-level semantic terms using a statistical document model. This model simulates the long-term relationships between characters. Documents are then indexed according to these terms, and a query document is proposed to retrieve the relevant documents. The query document can be a single keyword, or it can be synthesized from a text string. The aim is to generate a semantic representation from low-level image pixels through pattern matching and document modeling. The conventional approach of generating semantic terms in document retrieval includes every possible symbol sequence in the feature representation. Comparatively, our approach can considerably reduce the dimensions of the feature space while producing retrieval results comparable to those of the conventional and state-of-the-art approaches.
引用
收藏
页码:287 / 306
页数:19
相关论文
共 50 条
  • [31] Content-based image and video retrieval
    Vasconcelos, N
    SIGNAL PROCESSING, 2005, 85 (02) : 231 - 232
  • [32] A new content-based image retrieval
    Zhang, Zhen-Hua
    Quan, Yong
    Li, Wen-Hui
    Guo, Wu
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 4013 - +
  • [33] Content-Based Image Retrieval Research
    Duan, Guoyong
    Yang, Jing
    Yang, Yilong
    2011 INTERNATIONAL CONFERENCE ON PHYSICS SCIENCE AND TECHNOLOGY (ICPST), 2011, 22 : 471 - 477
  • [34] Faceted content-based image retrieval
    Amato, Giuseppe
    Meghini, Carlo
    DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 402 - 406
  • [35] Content-based image retrieval with WISFC
    Zhang, H. (guwenjiao1989@126.com), 1600, Binary Information Press (10):
  • [36] Prefetching for content-based image retrieval
    Yoon, J
    Jayant, N
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A413 - A416
  • [37] Content-based ultrasound image retrieval
    Kwak, DM
    Kim, BS
    Park, CH
    Kim, SJ
    Kim, YM
    Park, KH
    METMBS'01: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2001, : 512 - 517
  • [38] Content-based image retrieval - A survey
    Choras, Ryszard S.
    BIOMETRICS, COMPUTER SECURITY SYSTEMS AND ARTIFICIAL INTELLIGENCE APPLICATIONS, 2006, : 31 - 44
  • [39] Content-Based Histopathological Image Retrieval
    Nunez-Fernandez, Camilo
    Farias, Humberto
    Solar, Mauricio
    SENSORS, 2025, 25 (05)
  • [40] Study on Content-Based of Image Retrieval
    Zhang, Chi
    Huang, Lei
    LISS 2013, 2015, : 591 - 594