Managing very large document collections using semantics

被引:1
|
作者
Wang, GR [1 ]
Lu, HJ
Yu, G
Bao, YB
机构
[1] Northeastern Univ, Dept Comp Sci, Shenyang 110004, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
semantic document; multidimensional exploring; document querying;
D O I
10.1007/BF02948912
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a system is presented where documents are no longer identified by their file names. Instead, a document is represented by its semantics in terms of descriptor and content vector. The descriptor of a document consists of a set of attributes, such as date of creation, its type, its size, annotations, etc. The content vector of a document consists of a set of terms extracted from the document. In this paper, a semantic document management system XBASE is designed and implemented based on the semantics and the functions of three main modules, X-Loader, X-Explorer and X-Query.
引用
收藏
页码:403 / 406
页数:4
相关论文
共 50 条
  • [31] Lightweight LCP construction for very large collections of strings
    Cox, Anthony J.
    Garofalo, Fabio
    Rosone, Giovanna
    Sciortino, Marinella
    JOURNAL OF DISCRETE ALGORITHMS, 2016, 37 : 17 - 33
  • [32] Graph Navigation for Exploring Very Large Image Collections
    Barthel, Kai Uwe
    Hezel, Nico
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 411 - 416
  • [33] Computing the optimal BWT of very large string collections
    Cenzato, Davide
    Guerrini, Veronica
    Liptak, Zsuzsanna
    Rosone, Giovanna
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 71 - 80
  • [34] A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections
    Pappas, Dimitris
    Androutsopoulos, Ion
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3896 - 3907
  • [35] Using webspaces to model document collections on the web
    Van Zwol, R
    Apers, PMG
    CONCEPTUAL MODELING FOR E-BUSINESS AND THE WEB, PROCEEDINGS, 2000, 1921 : 101 - 114
  • [36] Between a Rock and a Hard Place: Managing Government Document Collections in a Digital World
    Sowell, Steven L.
    Boock, Michael H.
    Landis, Lawrence A.
    Nutefall, Jennifer E.
    COLLECTION MANAGEMENT, 2012, 37 (02) : 98 - 109
  • [37] Using webspaces to model document collections on the web
    Van Zwol, Roelof
    Apers, Peter M.G.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2000, 1921 : 101 - 114
  • [38] A fast text similarity measure for large document collections using multireference cosine and genetic algorithm
    Mohammadi, Hamid
    Khasteh, Seyed Hossein
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (02) : 999 - 1013
  • [39] A Scalable Model for Tracking Topical Evolution in Large Document Collections
    Naim, Sheikh Motahar
    Boedihardjo, Arnold P.
    Hossain, M. Shahriar
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 726 - 735
  • [40] Figure search by text in large scale digital document collections
    Yurtsever, M. Mucahit Enes
    Ozcan, Muhammet
    Taruz, Zubeyir
    Eken, Suleyman
    Sayar, Ahmet
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01):