Exemplary documents: a foundation for information retrieval design

被引:15
|
作者
Blair, DC [1 ]
Kimbrough, SO
机构
[1] Univ Michigan, Grad Sch Business, Ann Arbor, MI 48109 USA
[2] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Associative processing - Indexing (of information) - Libraries - Text processing;
D O I
10.1016/S0306-4573(01)00027-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieval systems in non-library contexts. Here, the cost of developing an indexing scheme independent of the documents to be retrieved is often considered too high to implement. As a result, simple full-text retrieval or, to a lesser extent, automatic extractive or associative indexing methods are the predominant methods used in non-library contexts. This paper suggests an alternative document representation method based on what we call exemplary documents. Exemplary documents are those documents that describe or exhibit the intellectual structure of a particular field of interest. In so doing, they provide both an indexing vocabulary for that area and, more importantly, a narrative context in which the indexing terms have a clearer meaning. Further, it is much easier to develop an indexing scheme by using exemplary documents than it is to do so from scratch. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:363 / 379
页数:17
相关论文
共 50 条
  • [1] Design and implementation of a structured information retrieval system for SGML documents
    Han, SG
    Son, JH
    Chang, JW
    Zhoo, ZC
    6TH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 1999, : 81 - 88
  • [2] Design and implementation of automatic indexing for information retrieval with Arabic documents
    Hmeidi, I
    Kanaan, G
    Evens, M
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1997, 48 (10): : 867 - 881
  • [3] INFORMATION RETRIEVAL FOR SHORT DOCUMENTS
    Qi Haoliang Li Mu Gao Jianfeng Li Sheng Ministry of Education Microsoft Key Laboratory of Natural Language Processing and Speech Harbin Institute of Technology Harbin China Microsoft Research Asia Beijing China Microsoft Research Redmond WA USA
    Journal of Electronics, 2006, (06) : 933 - 936
  • [4] ANNOTATIONS ON DOCUMENTS FOR INFORMATION RETRIEVAL
    Patil, Vishal A.
    Khambre, Pankaj
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [5] INFORMATION RETRIEVAL FOR SHORT DOCUMENTS
    Qi Haoliang Li Mu* Gao Jianfeng** Li Sheng (Ministry of Education - Microsoft Key Laboratory of Natural Language Processing and Speech (Harbin Institute of Technology)
    Journal of Electronics(China), 2006, (06) : 933 - 936
  • [6] Information retrieval from spoken documents
    Fapso, M
    Smrz, P
    Schwarz, P
    Szöke, I
    Schwarz, M
    Cernocky, J
    Karafiát, M
    Burget, L
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 410 - 416
  • [7] Considering Documents in Lifelog Information Retrieval
    Gupta, Rashmi
    ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 497 - 500
  • [8] Documents, data, information retrieval, & XML
    Fichter, D
    Cervone, F
    ONLINE, 2000, 24 (06): : 30 - +
  • [9] Flexible information retrieval on XML documents
    Grabs, T
    Schek, HJ
    INTELLIGENT SEARCH ON XML DATA: APPLICATIONS, LANGUAGES, MODELS IMPLEMENTATIONS AND BENCHMARKS, 2003, 2818 : 95 - 106
  • [10] Information retrieval system for handwritten documents
    Srihari, S
    Ganesh, A
    Tomai, C
    Shin, YC
    Huang, C
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 298 - 309