Searching and visualization of references in research documents

被引:0
|
作者
Nadirman, Firnas [1 ]
Ridha, Ahmad [2 ]
Annisa [2 ]
机构
[1] Agency for the Assessment and Application of Technology, Jl. M.H. Thamrin No. 8, Jakarta, 10340, Indonesia
[2] Department of Computer Science, Bogor Agricultural University, Kampus IPB Darmaga, Jl. Meranti Wing 20 Level 5-6, Bogor, 16680, Indonesia
关键词
AS graph - Author relationships - Automatic extraction - Paratools - PDF files - Search system - Text file - Writing guidelines;
D O I
10.12928/TELKOMNIKA.v12i2.2033
中图分类号
学科分类号
摘要
This research aims to develop a module for information retrieval that can trace references from bibliography entries of research documents, specifically those based on Bogor Agricultural University (IPB)'s writing guidelines. A total of 242 research documents in PDF from the Department of Computer Science IPB were used to generate parsing patterns to extract the bibliography entries. With modified ParaTools, automatic extraction of bibliography entries was performed on text files generated from the PDF files. The entries are stored in a database that is used to visualize author relationship as graphs. This module is supplemented by an information retrieval system based on Sphinx search system and also provides information of authors' publications and citations. Evaluation showed that (1) bibliography entry extraction missed only 5.37% bibliography entries caused by incorrect bibliography formatting, (2) 91.54% bibliography entry attributes could be identified correctly, and (3) 90.31% entries were successfully connected to other documents.
引用
收藏
页码:447 / 454
相关论文
共 50 条
  • [21] Searching for Physical Documents in Archival Repositories
    Suzuki, Tokinori
    Oard, Douglas W.
    Ishita, Emi
    Tomiura, Yoichi
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2614 - 2618
  • [22] A survey in indexing and searching XML documents
    Luk, RWP
    Leong, HV
    Dillon, TS
    Chan, ATS
    Croft, WB
    Allan, J
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (06): : 415 - 437
  • [23] Searching for beauty in data visualization
    Banks, Michael
    PHYSICS WORLD, 2014, 27 (04) : 10 - 10
  • [24] Chinese word searching in imaged documents
    Lu, Y
    Tan, CL
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (02) : 229 - 246
  • [25] Searching XML documents - Preliminary work
    Hassler, Marcus
    Bouchachia, Abdelhamid
    ADVANCES IN XML INFORMATION RETRIEVAL AND EVALUATION, 2006, 3977 : 119 - 133
  • [26] Efficient searching and retrieval of documents in PROSA
    Carchiolo, Vincenza
    Malgeri, Michele
    Mangioni, Giuseppe
    Nicosia, Vincenzo
    DATABASES, INFORMATION SYSTEMS, AND PEER-TO-PEER COMPUTING, 2007, 4125 : 298 - +
  • [27] Visualization of plagiarism detected in documents
    Mala, T.
    Geetha, T. V.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL I, PROCEEDINGS, 2007, : 92 - 96
  • [28] MRCSI: Compressing and Searching String Collections with Multiple References
    Wandelt, Sebastian
    Leser, Ulf
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (05): : 461 - 472
  • [30] HOW BUSINESS DOCUMENTS AND REFERENCES CAN BE INDEXED EFFECTIVELY
    KUTTER, F
    NACHRICHTEN FUR DOKUMENTATION, 1957, 8 (03): : 132 - 134