Managing very large document collections using semantics

被引:1
|
作者
Wang, GR [1 ]
Lu, HJ
Yu, G
Bao, YB
机构
[1] Northeastern Univ, Dept Comp Sci, Shenyang 110004, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
semantic document; multidimensional exploring; document querying;
D O I
10.1007/BF02948912
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a system is presented where documents are no longer identified by their file names. Instead, a document is represented by its semantics in terms of descriptor and content vector. The descriptor of a document consists of a set of attributes, such as date of creation, its type, its size, annotations, etc. The content vector of a document consists of a set of terms extracted from the document. In this paper, a semantic document management system XBASE is designed and implemented based on the semantics and the functions of three main modules, X-Loader, X-Explorer and X-Query.
引用
收藏
页码:403 / 406
页数:4
相关论文
共 50 条
  • [21] Diachronic Linguistic Periodization of Temporal Document Collections for Discovering Evolutionary Word Semantics
    Duan, Yijun
    Jatowt, Adam
    Yoshikawa, Masatoshi
    Liu, Xin
    Matono, Akiyoshi
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 3 - 17
  • [22] Document Expansion Using External Collections
    Sherman, Garrick
    Efron, Miles
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1045 - 1048
  • [23] Context grabbing: Assigning metadata in large document collections
    Hinrichs, J
    Pipek, V
    Wulf, V
    ECSCW 2005: PROCEEDINGS OF THE NINTH EUROPEAN CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK, 2005, : 367 - 386
  • [24] Interactive visualization for opportunistic exploration of large document collections
    Lehmann, Simon
    Schwanecke, Ulrich
    Doerner, Ralf
    INFORMATION SYSTEMS, 2010, 35 (02) : 260 - 269
  • [25] Spotting relevant information in extremely large document collections
    Kohonen, T
    COMPUTATIONAL INTELLIGENCE: THEORY AND APPLICATIONS, 1999, 1625 : 59 - 61
  • [26] A method for calculating term similarity on large document collections
    Bein, WW
    Coombs, JS
    Taghva, K
    ITCC 2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 2003, : 199 - 203
  • [27] ThemeRiver: Visualizing thematic changes in large document collections
    Havre, S
    Hetzler, E
    Whitney, P
    Nowell, L
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2002, 8 (01) : 9 - 20
  • [28] Generating hierarchical document indices from common denominators in large document collections
    OKane, KC
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (01) : 105 - 115
  • [30] Lightweight BWT Construction for Very Large String Collections
    Bauer, Markus J.
    Cox, Anthony J.
    Rosone, Giovanna
    COMBINATORIAL PATTERN MATCHING, 22ND ANNUAL SYMPOSIUM, CPM 2011, 2011, 6661 : 219 - 231