Capisco: low-cost concept-based access to digital libraries

被引:2
|
作者
Hinze, Annika [1 ]
Bainbridge, David [1 ]
Cunningham, Sally Jo [1 ]
Taube-Schock, Craig [1 ]
Matamua, Rangi [1 ]
Downie, J. Stephen [2 ]
Rasmussen, Edie [3 ]
机构
[1] Univ Waikato, Hamilton, New Zealand
[2] Univ Illinois, Urbana, IL 61801 USA
[3] Univ British Columbia, Vancouver, BC, Canada
基金
美国安德鲁·梅隆基金会;
关键词
Semantic analysis; Disambiguation; Indexing; Semantic enrichment; Metadata enrichment; QUERY EXPANSION; WIKIPEDIA; SYSTEM;
D O I
10.1007/s00799-018-0232-3
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In this article, we present the conceptual design and report on the implementation of Capisco-a low-cost approach to concept-based access to digital libraries. Capisco avoids the need for complete semantic document markup using ontologies by leveraging an automatically generated Concept-in-Context (CiC) network. The network is seeded by a priori analysis of Wikipedia texts and identification of semantic metadata. Our Capisco system disambiguates the semantics of terms in the documents by their semantics and context and identifies the relevant CiC concepts. Supplementary to this, the disambiguation of search queries is done interactively, to fully utilize the domain knowledge of the scholar. For established digital library systems, completely replacing, or even making significant changes to the document retrieval mechanism (document analysis, indexing strategy, query processing, and query interface) would require major technological effort and would most likely be disruptive. In addition to presenting Capisco, we describe ways to harness the results of our developed semantic analysis and disambiguation, while retaining the existing keyword-based search and lexicographic index. We engineer this so the output of semantic analysis (performed off-line) is suitable for import directly into existing digital library metadata and index structures, and thus incorporated without the need for architecture modifications.
引用
收藏
页码:307 / 334
页数:28
相关论文
共 50 条
  • [1] Capisco: low-cost concept-based access to digital libraries
    Annika Hinze
    David Bainbridge
    Sally Jo Cunningham
    Craig Taube-Schock
    Rangi Matamua
    J. Stephen Downie
    Edie Rasmussen
    International Journal on Digital Libraries, 2019, 20 : 307 - 334
  • [2] Concept-based browsing in video libraries
    Hollfelder, S
    Everts, A
    Thiel, U
    IEEE FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES, PROCEEDINGS, 1999, : 105 - 115
  • [3] Concept-based information access
    Ozcan, M
    Aslandogan, YA
    ITCC 2005: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 1, 2005, : 794 - 799
  • [4] A HARDWARE CONCEPT FOR AN EXPANDABLE LOW-COST DIGITAL AUDIO MIXER
    SKRITEK, P
    PARTH, E
    POLLEROS, R
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1983, 31 (05): : 366 - 366
  • [5] The ADEPT concept-based Digital Learning Environment
    Smith, TR
    Ancona, D
    Buchel, O
    Freeston, M
    Heller, W
    Nottrott, R
    Tierney, T
    Ushakov, A
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2003, 2769 : 300 - 312
  • [6] A LOW-COST DIGITAL TESLAMETER
    TORZO, G
    SCONZA, A
    STORTI, R
    JOURNAL OF PHYSICS E-SCIENTIFIC INSTRUMENTS, 1987, 20 (03): : 260 - 262
  • [8] LOW-COST DIGITAL TELERADIOLOGY
    REPONEN, J
    LAHDE, S
    TERVONEN, O
    ILKKO, E
    RISSANEN, T
    SURAMO, I
    EUROPEAN JOURNAL OF RADIOLOGY, 1995, 19 (03) : 226 - 231
  • [9] LOW-COST DIGITAL SUBTRACTION
    FORD, KK
    HEINZ, ER
    JOHNSON, GA
    DRAYER, BP
    DUBOIS, PJ
    AMERICAN JOURNAL OF NEURORADIOLOGY, 1982, 3 (01) : 99 - 99
  • [10] LOW-COST DIGITAL VOLTMETERS
    不详
    ELECTRO-TECHNOLOGY, 1968, 81 (01): : 65 - &