Automated document content characterization for a multimedia document retrieval system

被引:0
|
作者
Koivusaari, M
Sauvola, J
Pietikainen, M
机构
关键词
document layout analysis; predictive coding; document database; retrieval; document content characterization; object-oriented database;
D O I
10.1117/12.290337
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We propose a new approach to automate document image layout extraction for an object-oriented database feature population using rapid low level feature analysis, preclassification and predictive coding. The layout information comprised of region location and classification data is transformed into 'feature object(s)'. The information is then fed into an intelligent document image retrieval system (IDIR) to be utilized in document retrieval schemes. The IDIR system consists of user interface, object-oriented database and a variety of document image analysis algorithms. In this paper the object-oriented storage model and the database system are presented in formal and functional domains. Moreover, the graphical user interface and a visual document image browser are described. The document analysis techniques used at document characterization are also presented. In this context the documents consist of text, picture and other media (possibly embedded) data. Documents are stored in the database as document, page and region objects. Our test system has been implemented and tested using a document database of 10 000 documents.
引用
收藏
页码:148 / 159
页数:12
相关论文
共 50 条
  • [1] Multimedia document retrieval
    Ozkarahan, Esen, 1600, Pergamon Press Inc, Tarrytown, NY, United States (31):
  • [2] A design and implementation of multimedia document retrieval system
    Lee, SH
    Yoo, CG
    Hwang, CJ
    Jang, DM
    COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - INTELLIGENT IMAGE PROCESSING, DATA ANALYSIS & INFORMATION RETRIEVAL, 1999, 56 : 278 - 283
  • [3] MULTIMEDIA DOCUMENT-RETRIEVAL
    OZKARAHAN, E
    INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (01) : 113 - 131
  • [4] The Cambridge University multimedia document retrieval demo system
    Tuerk A.
    Johnson S.E.
    Jourlin P.
    Jones K.S.
    Woodland P.C.
    International Journal of Speech Technology, 2001, 4 (3-4) : 241 - 250
  • [5] Automated document conversion system for simple multimedia platforms
    Martinez-Alvarez, R. P.
    Costas-Rodriguez, S.
    Gonzalez-Castano, F. J.
    Gil-Castineira, F.
    2010 7TH IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE-CCNC 2010, 2010, : 235 - +
  • [6] Automated scientific document retrieval
    Kaur, Jaspal
    Yusof, Mohammad
    Boursier, Patrice
    Ogier, Jean-Marc
    2010 The 2nd International Conference on Computer and Automation Engineering, ICCAE 2010, 2010, 5 : 732 - 736
  • [7] Automated Scientific Document Retrieval
    Kaur, Jaspal
    Yusof, Mohammad
    Boursier, Patrice
    Ogier, Jean-Marc
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 5, 2010, : 732 - 736
  • [8] AN OPEN FRAMEWORK FOR A MULTIMEDIA MEDICAL DOCUMENT SYSTEM (A MULTIMEDIA DOCUMENT SYSTEM FRAMEWORK)
    IP, HHS
    LAW, KCK
    CHAN, SL
    JOURNAL OF MICROCOMPUTER APPLICATIONS, 1995, 18 (03): : 215 - 232
  • [9] Document content inventory & retrieval
    Moll, Michael A.
    Baird, Henry S.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 93 - 97
  • [10] XML document retrieval system based on document structure and image content for digital museum
    Chang, JW
    Kim, YJ
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 107 - 111