On extracting link information of relationship instances from a web site

被引：0

作者：

Naing, MM ^{[1
]}

Lim, EP ^{[1
]}

Goh, DHL ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore

来源：

WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS | 2003年 / 2853卷

关键词：

ontology; information extraction; hyperlink structure;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Web pages from a web site can often be associated with concepts in an ontology, and pairs of web pages can also be associated with relationships between concepts. With such associations, web pages can be searched, browsed or even reorganized based on their concept and relationship labels. In this paper, we investigate the problem of extracting link information of relationship instances from a web site. We define the notion of link chain and formulate the link chain extraction problem. An extraction method based on sequential covering has been proposed to solve the problem. This paper presents the proposed method and the experiments to evaluate its performance. We have applied the method to extract link chain information from the Yahoo! Movie Web Site with very promising results.

引用

页码：213 / 226

页数：14

共 50 条

[31] Geographic information on the web: Extracting demographic and market research information
Linberger, P
White, GW
19TH ANNUAL NATIONAL ONLINE MEETING, PROCEEDINGS, 1998, : 235 - 242
[32] Extended link analysis for extracting spatial information hubs
Zhang, J
Ishikawa, Y
Kitagawa, H
INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 17 - 22
[33] Extracting Structure of Web Site Based on Hyperlink Analysis
Li, Feng
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 10919 - 10922
[34] Extended link analysis for extracting spatial information hubs
Zhang, J. (zjw@kde.cs.tsukuba.ac.jp), IEEE Computer Society; Database Society of Japan, DBSJ; Information Processing Society of Japan, IPSJ; Institute of Electronics, Info. and Com. Eng., IEICE (Institute of Electrical and Electronics Engineers Computer Society):
[35] Beyond supervised learning of wrappers for extracting information from unseen Web sites
Wong, TL
Lam, W
Wang, W
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 725 - 733
[36] Extracting information from the web for concept learning and collaborative filtering - (Extended abstract)
Cohen, WW
ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 1 - 12
[37] The Technology of Extracting Content Information from Web Page Based on DOM Tree
Yuan, Dingrong
Mo, Zhuoying
Xie, Bing
Xie, Yangcai
ADVANCED RESEARCH ON ELECTRONIC COMMERCE, WEB APPLICATION, AND COMMUNICATION, PT 2, 2011, 144 : 271 - 278
[38] A Framework for Extracting Information from Semi-Structured Web Data Sources
Shaker, Malunoud
Ibrahim, Hamidah
Mustapha, Aida
Abdullah, Lili Nurliyana
THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 27 - 31
[39] A novel method for extracting information from web pages with multiple presentation templates
Qingzhong L.
Yanhui D.
An F.
Yongquan D.
Journal of Software, 2010, 5 (05) : 506 - 513
[40] Extracting Spatio-Temporal Information from Chinese Archaeological Site Text
Yuan, Wenjing
Yang, Lin
Yang, Qing
Sheng, Yehua
Wang, Ziyang
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (03)

← 1 2 3 4 5 →