Exemplary documents: a foundation for information retrieval design

被引:15
|
作者
Blair, DC [1 ]
Kimbrough, SO
机构
[1] Univ Michigan, Grad Sch Business, Ann Arbor, MI 48109 USA
[2] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Associative processing - Indexing (of information) - Libraries - Text processing;
D O I
10.1016/S0306-4573(01)00027-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieval systems in non-library contexts. Here, the cost of developing an indexing scheme independent of the documents to be retrieved is often considered too high to implement. As a result, simple full-text retrieval or, to a lesser extent, automatic extractive or associative indexing methods are the predominant methods used in non-library contexts. This paper suggests an alternative document representation method based on what we call exemplary documents. Exemplary documents are those documents that describe or exhibit the intellectual structure of a particular field of interest. In so doing, they provide both an indexing vocabulary for that area and, more importantly, a narrative context in which the indexing terms have a clearer meaning. Further, it is much easier to develop an indexing scheme by using exemplary documents than it is to do so from scratch. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:363 / 379
页数:17
相关论文
共 50 条
  • [21] Algebraic Modeling of Information Retrieval in XML Documents
    Georgiev, Bozhidar
    Georgieva, Adriana
    APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE '09), 2009, 1184 : 318 - +
  • [22] Intelligent support for information retrieval of web documents
    Koval, R
    Návrat, P
    COMPUTING AND INFORMATICS, 2002, 21 (05) : 509 - 528
  • [23] Semantic Proximity in Information Retrieval and Documents Classification
    Vishnyakov, Yury
    Vishnyakov, Renat
    14TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2013, : 131 - 134
  • [24] Foundation Models for Information Retrieval and Knowledge Processing
    Shang, Shuo
    Jiang, Renhe
    Shibasaki, Ryosuke
    Yan, Rui
    DATA INTELLIGENCE, 2024, 6 (04) : 891 - 892
  • [25] Foundation Models for Information Retrieval and Knowledge Processing
    Shuo Shang
    Renhe Jiang
    Ryosuke Shibasaki
    Rui Yan
    Data Intelligence, 2024, 6 (04) : 891 - 892
  • [26] Information retrieval design
    Ng, Kwong Bor
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (03) : 822 - 824
  • [27] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [28] Further Investigations for Documents Information Retrieval Based on DWT
    Dahab, Mohamed Yehia
    Kamel, Mahmoud
    Alnofaie, Sara
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 3 - 11
  • [29] Arabic Information Retrieval Using Semantic Analysis of Documents
    Al-Maghasbeh, Mohammad Khaled A.
    Bin Hamzah, Mohd Pouzi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (05): : 53 - 58
  • [30] Information Retrieval Strategies for Digitized Handwritten Medieval Documents
    Naji, Nada
    Savoy, Jacques
    INFORMATION RETRIEVAL TECHNOLOGY, 2011, 7097 : 103 - 114