Intelligent integration of information from semi-structured web data sources on the basis of ontology and meta-models

被引:3
|
作者
Arnicans, Guntis [1 ]
Karnitis, Girts [1 ]
机构
[1] Univ Latvia, Riga, Latvia
关键词
D O I
10.1109/DBIS.2006.1678494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As computer users face an increasing amount of various semi-structured information sources, the issue of correlating, integration and presenting related information to users becomes all the more important. As a solution, we propose the Semi-Structured Data Universal Data Browser, which, in its operations, makes use of descriptions of data sources that are presented in the form of meta-models or ontologies, ensuring the user's ability to use information from the data sources. Information from semi-structured data sources is analyzed, transformed and stored in the Semi-Structured Data Universal Data Browser database, which is based on meta-models. The ontologies of information in each data source are preserved, and they are mutually linked to logical and global ontologies through the use of mapping. As an example, we use the integration of information on Internet homepages about products and their classifications.
引用
收藏
页码:177 / +
页数:2
相关论文
共 35 条
  • [21] Extracting lists of data records from semi-structured web pages
    Alvarez, Manuel
    Pan, Alberto
    Raposo, Juan
    Bellas, Fernando
    Cacheda, Fidel
    DATA & KNOWLEDGE ENGINEERING, 2008, 64 (02) : 491 - 509
  • [22] Automatic information extraction from semi-structured Web pages by pattern discovery
    Chang, CH
    Hsu, CN
    Lui, SC
    DECISION SUPPORT SYSTEMS, 2003, 35 (01) : 129 - 147
  • [23] Interoperability and semi-structured data in an open web-based agent information system
    Lu, HG
    Sterling, L
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 80 - 86
  • [24] Recognition of Data Records in Semi-structured Web-Pages Using Ontology and χ2 Statistical Distribution
    Keshavarzi, Amin
    Rahmani, Amir Masoud
    Mohsenzadeh, Mehran
    Keshavarzi, Reza
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 675 - +
  • [25] Automated Extraction of Concept Matcher Thesaurus from Semi-Structured Catalogue-Like Sources of Data on the Web
    Lapaev, Maxim
    2016 18TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION AND SEMINAR ON INFORMATION SECURITY AND PROTECTION OF INFORMATION TECHNOLOGY (FRUCT-ISPIT), 2016, : 153 - 160
  • [26] RETRACTED: Extracting Information from Semi-structured Web Documents: A Framework (Retracted Article)
    Memon, Nasrullah
    Qureshi, Abdul Rasool
    Hicks, David
    Harkiolakis, Nicholas
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, 2008, 4977 : 54 - +
  • [27] Generating finite-state transducers for semi-structured data extraction from the Web
    Academia Sinica, Taipei, Taiwan
    Inf Syst, 8 (521-538):
  • [28] Generating finite-state transducers for semi-structured data extraction from the Web
    Hsu, CN
    Dung, MT
    INFORMATION SYSTEMS, 1998, 23 (08) : 521 - 538
  • [29] Information extraction from semi-structured data in the protein data bank by induction of a data description pattern
    Kawaguchi, Y
    Kaneta, Y
    Ohkawa, T
    Nakamura, H
    Ito, N
    METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 94 - 99
  • [30] A Lightweight Approach to Extract Interschema Properties from Structured, Semi-Structured and Unstructured Sources in a Big Data Scenario
    Cauteruccio, Francesco
    Lo Giudice, Paolo
    Musarella, Lorenzo
    Terracina, Giorgio
    Ursino, Domenico
    Virgili, Luca
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2020, 19 (03) : 849 - 889