Web directory construction using lexical chains

被引:0
|
作者
Stamou, S [1 ]
Krikos, V
Kokosis, P
Ntoulas, A
Christodoulakis, D
机构
[1] Univ Patras, Dept Comp Engn, Comp Technol Inst, GR-26500 Patras, Greece
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web Directories provide a way of locating relevant information on the Web. Typically, Web Directories rely on humans putting in significant time and effort into finding important pages on the Web and categorizing them in the Directory. In this paper we present a way for automating the creation of a Web Directory. At a high level, our method takes as input a subject hierarchy and a collection of pages. We first leverage a variety of lexical resources from the Natural Language Processing community to enrich our hierarchy. After that, we process the pages and identify sequences of important terms, which are referred to as lexical chains. Finally, we use the lexical chains in order to decide where in the enriched subject hierarchy we should assign every page. Our experimental results with real Web data show that our method is quite promising into assisting humans during page categorization.
引用
收藏
页码:138 / 149
页数:12
相关论文
共 50 条
  • [1] Automatic construction of web directory using hyperlink and anchor text
    Suzuki, Y
    Matsubara, S
    Yoshikawa, M
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 614 - 619
  • [2] Not as easy as it seems:: Automating the construction of lexical chains using Roget's Thesaurus
    Jarmasz, M
    Szpakowicz, S
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 544 - 549
  • [3] Using lexical chains for keyword extraction
    Ercan, Gonenc
    Cicekli, Ilyas
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1705 - 1714
  • [4] Keyword extraction based on lexical chains for Chinese news web pages
    Hu, Xue-Gang
    Li, Xing-Hua
    Xie, Fei
    Wu, Xin-Dong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (01): : 45 - 51
  • [5] Extractive Summarization of a Document Using Lexical Chains
    Mallick, Chirantana
    Dutta, Madhurima
    Das, Ajit Kumar
    Sarkar, Apurba
    Das, Asit Kumar
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 825 - 836
  • [6] Web directory
    Electronic Packaging and Production, 1999, 39 (13):
  • [7] Web directory
    Electronic Packaging and Production, 2000, 40 (05):
  • [8] Web directory
    Lasers & Optronics, 1999, 18 (04):
  • [9] Web directory
    Mining Engineering (Littleton, Colorado), 2000, 52 (11):
  • [10] Web directory
    Lasers & Optronics, 1998, 17 (04):