Web directory construction using lexical chains

被引:0
|
作者
Stamou, S [1 ]
Krikos, V
Kokosis, P
Ntoulas, A
Christodoulakis, D
机构
[1] Univ Patras, Dept Comp Engn, Comp Technol Inst, GR-26500 Patras, Greece
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web Directories provide a way of locating relevant information on the Web. Typically, Web Directories rely on humans putting in significant time and effort into finding important pages on the Web and categorizing them in the Directory. In this paper we present a way for automating the creation of a Web Directory. At a high level, our method takes as input a subject hierarchy and a collection of pages. We first leverage a variety of lexical resources from the Natural Language Processing community to enrich our hierarchy. After that, we process the pages and identify sequences of important terms, which are referred to as lexical chains. Finally, we use the lexical chains in order to decide where in the enriched subject hierarchy we should assign every page. Our experimental results with real Web data show that our method is quite promising into assisting humans during page categorization.
引用
收藏
页码:138 / 149
页数:12
相关论文
共 50 条
  • [31] Extracting Local Web Communities Using Lexical Similarity
    Zhang, Xianchao
    Xu, Wen
    Liang, Wenxin
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2010, 6193 : 327 - 337
  • [32] A semantic approach for text clustering using WordNet and lexical chains
    Wei, Tingting
    Lu, Yonghe
    Chang, Huiyou
    Zhou, Qiang
    Bao, Xianyu
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 2264 - 2275
  • [33] Using lexical patterns for extracting hyponyms from the web
    Ortega-Mendoza, Rosa M.
    Villasenor-Pineda, Luis
    Montes-Y-Gomez, Manuel
    MICAI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4827 : 904 - +
  • [34] Lexical Chains Segmentation in Summarization
    Tatar, Doina
    Mihis, Andreea Diana
    Czibula, Gabriela Serban
    PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, 2009, : 95 - 101
  • [35] Browse with a Social Web Directory
    Huang, Hao
    Gao, Yunjun
    Chen, Lu
    Li, Rui
    Chiew, Kevin
    He, Qinming
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 865 - 868
  • [36] Web directory a good idea
    Maciag, T
    COMMUNICATIONS NEWS, 1996, 33 (10): : 4 - 4
  • [37] TEXT SEGMENTATION USING ROGET-BASED WEIGHTED LEXICAL CHAINS
    Tatar, Doina
    Inkpen, Diana
    Czibula, Gabriela
    COMPUTING AND INFORMATICS, 2013, 32 (02) : 393 - 410
  • [38] A Lexical Pragmatic Account of Lexical Synonymy Construction
    Pang, Yang
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON LAW, LANGUAGE AND DISCOURSE: MULTICULTURALISM, MULTIMODALITY AND MULTIDIMENSIONALITY, 2012, : 26 - 32
  • [39] Web Service Discovery Using Lexical and Semantic Query Expansion
    Ma, Shang-Pin
    Li, Chia-Hsueh
    Tsai, Yao-Yu
    Lan, Ci-Wei
    2013 IEEE 10TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2013, : 423 - 428
  • [40] Scalable Semantic Annotation of Text Using Lexical and Web Resources
    Zavitsanos, Elias
    Tsatsaronis, George
    Varlamis, Iraklis
    Paliouras, Georgios
    ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 : 287 - +