Managing very large document collections using semantics

被引:1
|
作者
Wang, GR [1 ]
Lu, HJ
Yu, G
Bao, YB
机构
[1] Northeastern Univ, Dept Comp Sci, Shenyang 110004, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
semantic document; multidimensional exploring; document querying;
D O I
10.1007/BF02948912
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a system is presented where documents are no longer identified by their file names. Instead, a document is represented by its semantics in terms of descriptor and content vector. The descriptor of a document consists of a set of attributes, such as date of creation, its type, its size, annotations, etc. The content vector of a document consists of a set of terms extracted from the document. In this paper, a semantic document management system XBASE is designed and implemented based on the semantics and the functions of three main modules, X-Loader, X-Explorer and X-Query.
引用
收藏
页码:403 / 406
页数:4
相关论文
共 50 条
  • [41] RANKING LARGE DOCUMENT COLLECTIONS BY A STATE-SPACE SEARCH
    GORDON, MD
    INFORMATION PROCESSING & MANAGEMENT, 1991, 27 (01) : 27 - 41
  • [42] Detecting short passages of similar text in large document collections
    Lyon, C
    Malcolm, J
    Dickerson, B
    PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2001, : 118 - 125
  • [43] Possibilistic fuzzy co-clustering of large document collections
    Tjhi, William-Chandra
    Chen, Lihui
    PATTERN RECOGNITION, 2007, 40 (12) : 3452 - 3466
  • [44] Obtaining Technology Insights from Large and Heterogeneous Document Collections
    Dey, Lipika
    Mahajan, Diwakar
    Gupta, Hemant
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2014, : 102 - 109
  • [45] Figure search by text in large scale digital document collections
    Yurtsever, M. Mücahit Enes
    Özcan, Muhammet
    Taruz, Zübeyir
    Eken, Süleyman
    Sayar, Ahmet
    Concurrency and Computation: Practice and Experience, 2022, 34 (01)
  • [46] Exploration of large document collections by self-organizing maps
    Kohonen, T
    SIXTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1997, 40 : 5 - 7
  • [47] ProbMap-A probabilistic approach for mapping large document collections
    Hofmann, Thomas
    Intelligent Data Analysis, 2000, 4 (02) : 149 - 164
  • [48] Managing Very-Large Distributed Datasets
    Branco, Miguel
    Zaluska, Ed
    de Roure, David
    Salgado, Pedro
    Garonne, Vincent
    Lassnig, Mario
    Rocha, Ricardo
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008, PART I, 2008, 5331 : 775 - +
  • [49] Entropy-based authorship search in large document collections
    Zhao, Ying
    Zobel, Justin
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 381 - +
  • [50] Managing collections
    Chauwin, Ludovic
    IN SITU-REVUE DE PATRIMOINES, 2016, (30):