An automatic classification technique and tool for information retrieval of web documents

被引:0
|
作者
Di Martino, B [1 ]
Mazzocca, N [1 ]
Squeglia, A [1 ]
Mazzeo, A [1 ]
机构
[1] Univ Naples 2, Dipartimento Ingn Informaz, Naples, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe a technique, and its prototypical implementation, for Information Retrieval of text documents and Web pages and sites. The technique is based on automatic classification methods, more precisely on an unsupervised and hierarchical clustering method. The defined technique produces a hierarchical classification tree from the set of analyzed documents, and therefore classifies them with respect to paradigmatic similarity relationships. This technique has been implemented and a prototype tool has been produced, completely realized in Java. This prototype provides the user with information retrieval functionalities, by means of a search document map based graphic interface. The main functionalities of the tool are described in this paper.
引用
收藏
页码:1043 / 1050
页数:8
相关论文
共 50 条
  • [1] INTERPRETATION OF AUTOMATIC CLASSIFICATION IN INFORMATION-RETRIEVAL (ROUGH SEARCH OF DOCUMENTS)
    PANYR, J
    INTERNATIONAL CLASSIFICATION, 1982, 9 (01): : 11 - 18
  • [2] Ontology based Fuzzy Classification of Web Documents for Semantic Information Retrieval
    Joshi, Kajal
    Verma, Ashish
    Kandpal, Ankita
    Garg, Shalini
    Chauhan, Rashmi
    Goudar, R. H.
    2013 SIXTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2013, : 1 - 5
  • [3] Individualized Automatic Classification of Web Documents
    Tsai, Yihjia
    Chen, Kaun-Yu
    PROCEEDINGS OF 2010 CROSS-STRAIT CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY, 2010, : 410 - 412
  • [4] Intelligent support for information retrieval of web documents
    Koval, R
    Návrat, P
    COMPUTING AND INFORMATICS, 2002, 21 (05) : 509 - 528
  • [5] Semantic Proximity in Information Retrieval and Documents Classification
    Vishnyakov, Yury
    Vishnyakov, Renat
    14TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2013, : 131 - 134
  • [6] AUTOMATIC CLASSIFICATION IN INFORMATION-RETRIEVAL
    VANRIJSBERGEN, CJ
    DREXEL LIBRARY QUARTERLY, 1978, 14 (02): : 75 - 89
  • [7] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [8] Ontology-based automatic classification of web documents
    Song, MuHee
    Lim, SooYeon
    Kang, DongJin
    Lee, SangJo
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 690 - 700
  • [9] Automatic Web site classification in a large repository under information filtering and retrieval techniques
    Anagnostopoulos, I
    Kouzas, G
    Anagnostopoulos, C
    Vergados, D
    Papaleonidopoulos, I
    Generalis, A
    Loumos, V
    Kayafas, E
    11TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, PROCEEDINGS, 2002, : 279 - 283
  • [10] A technique towards automatic audio classification and retrieval
    Lu, GJ
    Hankinson, T
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1142 - 1145