Automatic extraction of data from 2-D plots in documents

被引:0
|
作者
Lu, Xiaonan [1 ]
Wang, James Z. [1 ]
Mitra, Prasenjit [1 ]
Giles, C. Lee [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by, end-users. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.
引用
收藏
页码:188 / 192
页数:5
相关论文
共 50 条
  • [31] Automatic Information Extraction from Electronic Documents Using Machine Learning
    Kamaleson, Nishanthan
    Chu, Dominique
    Otero, Fernando E. B.
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 183 - 194
  • [32] Automatic extraction of domain-specific stopwords from labeled documents
    Makrehchi, Masoud
    Kamel, Mohamed S.
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 222 - 233
  • [33] Deep Text Mining for Automatic Keyphrase Extraction from Text Documents
    Abulaish, Muhammad
    Jahiruddin
    Dey, Lipika
    JOURNAL OF INTELLIGENT SYSTEMS, 2011, 20 (04) : 327 - 351
  • [34] Automatic Extraction of Access Control Policies from Natural Language Documents
    Narouei, Masoud
    Takabi, Hassan
    Nielsen, Rodney
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2020, 17 (03) : 506 - 517
  • [35] Automatic ontology-based knowledge extraction from web documents
    Alani, H
    Kim, S
    Millard, DE
    Weal, MJ
    Hall, W
    Lewis, PH
    Shadbolt, NR
    IEEE INTELLIGENT SYSTEMS, 2003, 18 (01) : 14 - 21
  • [36] Automatic extraction of titles from general documents using machine learning
    Hu, YH
    Li, H
    Cao, YB
    Meyerzon, D
    Zheng, QH
    PROCEEDINGS OF THE 5TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2005, : 145 - 154
  • [37] Automatic extraction of titles from general documents using machine learning
    Hu, Yunhua
    Li, Hang
    Cao, Yunbo
    Teng, Li
    Meyerzon, Dmitriy
    Zheng, Qinghua
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (05) : 1276 - 1293
  • [38] Automatic foreign person names extraction from chinese documents on the web
    Gao, Hong
    Huang, Degen
    Liu, Wei
    Yang, Yuansheng
    ICIC Express Letters, 2010, 4 (01): : 189 - 196
  • [39] Title extraction and generation from OCR'd documents
    Taghva, Kazem
    Condit, Allen
    Lumos, Steve
    Borsack, Julie
    Nartker, Thomas
    DOCUMENT RECOGNITION AND RETRIEVAL XIV, 2007, 6500
  • [40] Automatic contouring for breast tumors in 2-D sonography
    Huang, Yu-Len
    Chen, Dar-Ren
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 3225 - 3228