On extracting link information of relationship instances from a web site

被引:0
|
作者
Naing, MM [1 ]
Lim, EP [1 ]
Goh, DHL [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
来源
WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS | 2003年 / 2853卷
关键词
ontology; information extraction; hyperlink structure;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages from a web site can often be associated with concepts in an ontology, and pairs of web pages can also be associated with relationships between concepts. With such associations, web pages can be searched, browsed or even reorganized based on their concept and relationship labels. In this paper, we investigate the problem of extracting link information of relationship instances from a web site. We define the notion of link chain and formulate the link chain extraction problem. An extraction method based on sequential covering has been proposed to solve the problem. This paper presents the proposed method and the experiments to evaluate its performance. We have applied the method to extract link chain information from the Yahoo! Movie Web Site with very promising results.
引用
收藏
页码:213 / 226
页数:14
相关论文
共 50 条
  • [31] Geographic information on the web: Extracting demographic and market research information
    Linberger, P
    White, GW
    19TH ANNUAL NATIONAL ONLINE MEETING, PROCEEDINGS, 1998, : 235 - 242
  • [32] Extended link analysis for extracting spatial information hubs
    Zhang, J
    Ishikawa, Y
    Kitagawa, H
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 17 - 22
  • [33] Extracting Structure of Web Site Based on Hyperlink Analysis
    Li, Feng
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 10919 - 10922
  • [34] Extended link analysis for extracting spatial information hubs
    Zhang, J. (zjw@kde.cs.tsukuba.ac.jp), IEEE Computer Society; Database Society of Japan, DBSJ; Information Processing Society of Japan, IPSJ; Institute of Electronics, Info. and Com. Eng., IEICE (Institute of Electrical and Electronics Engineers Computer Society):
  • [35] Beyond supervised learning of wrappers for extracting information from unseen Web sites
    Wong, TL
    Lam, W
    Wang, W
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 725 - 733
  • [36] Extracting information from the web for concept learning and collaborative filtering - (Extended abstract)
    Cohen, WW
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 1 - 12
  • [37] The Technology of Extracting Content Information from Web Page Based on DOM Tree
    Yuan, Dingrong
    Mo, Zhuoying
    Xie, Bing
    Xie, Yangcai
    ADVANCED RESEARCH ON ELECTRONIC COMMERCE, WEB APPLICATION, AND COMMUNICATION, PT 2, 2011, 144 : 271 - 278
  • [38] A Framework for Extracting Information from Semi-Structured Web Data Sources
    Shaker, Malunoud
    Ibrahim, Hamidah
    Mustapha, Aida
    Abdullah, Lili Nurliyana
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 27 - 31
  • [39] A novel method for extracting information from web pages with multiple presentation templates
    Qingzhong L.
    Yanhui D.
    An F.
    Yongquan D.
    Journal of Software, 2010, 5 (05) : 506 - 513
  • [40] Extracting Spatio-Temporal Information from Chinese Archaeological Site Text
    Yuan, Wenjing
    Yang, Lin
    Yang, Qing
    Sheng, Yehua
    Wang, Ziyang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (03)