Integrating HTML']HTML tables using semantic hierarchies and meta-data sets

被引:2
|
作者
Lim, SJ [1 ]
Ng, YK [1 ]
Yang, XC [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA
来源
IDEAS 2002: INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS | 2002年
关键词
D O I
10.1109/IDEAS.2002.1029668
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the Internet is a global network, there is a demand on accessing closely related data without browsing through different Web documents. A significant amount of these data are presented in HTML documents. Since data contents of HTML documents are intervened by markups, it is not trivial to integrate and provide a unified view of closely related data in different HTML documents. In this paper we present an approach for integrating semantically related data in any HTML tables that belong to a particular domain of interest (ID), such as house/apartment rental, by using the semantic hierarchies generated from the tables and the predefined meta-data sets that indicate related column names in ID. In our approach, we capture each data source as semi-structured data, called semantic hierarchy, and the end result of integrating different HTML tables of ID is a unified view of data in the tables, which is presented in an XML document. Besides HTML tables, our approach can be adopted by any system that integrates semi-structured data across different platforms.
引用
收藏
页码:160 / 169
页数:10
相关论文
共 50 条
  • [31] XML and MPEG-7 for interactive annotation and retrieval using semantic meta-data
    Lux, M
    Klieber, W
    Becker, J
    Tochtermann, K
    Mayer, H
    Neuschmied, H
    Haas, W
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2002, 8 (10): : 965 - 984
  • [32] Concept Focus: Semantic Meta-Data for Describing MOOC Content
    Mesbah, Sepideh
    Chen, Guanliang
    Torre, Manuel Valle
    Bozzon, Alessandro
    Lofi, Christoph
    Houben, Geert-Jan
    LIFELONG TECHNOLOGY-ENHANCED LEARNING, EC-TEL 2018, 2018, 11082 : 467 - 481
  • [33] Data Resource Semantic Support Method Research based on Meta-data Annotation
    Xu, Ke
    Cai, Hongming
    2016 WORLD AUTOMATION CONGRESS (WAC), 2016,
  • [34] LegalHTML']HTML: Semantic mark-up of legal acts using web technologies
    Stellato, Armando
    Fiorelli, Manuel
    COMPUTER LAW & SECURITY REVIEW, 2023, 51
  • [35] Semantic heterogeneity in multidatabase systems: A review and a proposed meta-data structure
    Wang, TW
    Murphy, KE
    JOURNAL OF DATABASE MANAGEMENT, 2004, 15 (04) : 71 - 87
  • [36] Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval
    Venkataramanan, Aishwarya
    Laviale, Martin
    Pradalier, Cedric
    COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 422 - 431
  • [37] The "Squeezer": an HTML']HTML programme designed to estimate relative insulin sensitivity and relative beta cell function using OGTT data PH Contreras;
    Contreras, P. H.
    DIABETOLOGIA, 2020, 63 (SUPPL 1) : S163 - S163
  • [38] Extraction of Meta-Data for Recommendation Using Keyword Mapping
    Kim, Geon-Woo
    Kim, Woo-Hyeon
    Chung, Kyungyong
    Kim, Joo-Chang
    IEEE ACCESS, 2024, 12 : 103647 - 103659
  • [39] Pigeon-Chart: A Customized HTML']HTML Element for Data Visualization in Data-Driven Web Application Using AngularJS']JS, HighCharts, UnderscoreJS']JS and PHP
    Hua, Eva Cheong Chiek
    Nen, Voon Yang
    Tee, Fu Swee
    Ann, Ong Chin
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION SYSTEMS (ICCIS), 2018, : 247 - 252
  • [40] Using visual cues for extraction of tabular data from arbitrary HTML documents
    Krüpl, B. (kruepl@dbai.tuwien.ac.at), 1600, et al.; Fuji Xerox Co., Ltd.; Hitachi, Ltd.; NEC; World Wide Web Consortium (W3C); Yahoo (Association for Computing Machinery (ACM)):