Intelligent integration of information from semi-structured web data sources on the basis of ontology and meta-models

被引:3
|
作者
Arnicans, Guntis [1 ]
Karnitis, Girts [1 ]
机构
[1] Univ Latvia, Riga, Latvia
关键词
D O I
10.1109/DBIS.2006.1678494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As computer users face an increasing amount of various semi-structured information sources, the issue of correlating, integration and presenting related information to users becomes all the more important. As a solution, we propose the Semi-Structured Data Universal Data Browser, which, in its operations, makes use of descriptions of data sources that are presented in the form of meta-models or ontologies, ensuring the user's ability to use information from the data sources. Information from semi-structured data sources is analyzed, transformed and stored in the Semi-Structured Data Universal Data Browser database, which is based on meta-models. The ontologies of information in each data source are preserved, and they are mutually linked to logical and global ontologies through the use of mapping. As an example, we use the integration of information on Internet homepages about products and their classifications.
引用
收藏
页码:177 / +
页数:2
相关论文
共 35 条
  • [31] Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
    Zhou, Shuyan
    Zhou, Li
    Yang, Yue
    Lyu, Qing
    Yin, Pengcheng
    Callison-Burch, Chris
    Neubig, Graham
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2998 - 3012
  • [32] Ontology-based information extraction and integration from heterogeneous data sources
    Buitelaar, Paul
    Cimiano, Philipp
    Frank, Anette
    Hartung, Matthias
    Racloppa, Stefania
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2008, 66 (11) : 759 - 788
  • [33] Self-Training for Label-Efficient Information Extraction from Semi-Structured Web-Pages
    Sarkhel, Ritesh
    Huang, Binxuan
    Lockard, Cohn
    Shiralkar, Prashant
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 3098 - 3110
  • [34] Information Extraction from Semi-Structured WEB Page Based on DOM Tree and Its Application in Scientific Literature Statistical Analysis System
    Li WeiDong
    Dong Yibing
    Wang RuiJiang
    Tian HongXia
    2009 IITA INTERNATIONAL CONFERENCE ON SERVICES SCIENCE, MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 124 - +
  • [35] Large language models for data extraction from unstructured and semi-structured electronic health records: a multiple model performance evaluation
    Ntinopoulos, Vasileios
    Biefer, Hector Rodriguez Cetina
    Tudorache, Igor
    Papadopoulos, Nestoras
    Odavic, Dragan
    Risteski, Petar
    Haeussler, Achim
    Dzemali, Omer
    BMJ HEALTH & CARE INFORMATICS, 2025, 32 (01)