An Algorithm of Semi-structured Data Scheme Extraction Based on OEM Model

被引:0
|
作者
Gong, An [1 ]
Yang, Xue-wei [1 ]
机构
[1] China Univ Petr E China, Coll Comp & Commun Engn, Dongying 257061, Peoples R China
关键词
Semi-structured data; frequent patterns mining; OEM; the longest frequent label path;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to get the target model of semi-structured data rapidly, effectively and accurately, by combining the related nature of label path in the paper, this paper proposes an algorithm that can extract target model from the OEM model of semi-structured data directly. The basic idea of the Algorithm is: Using a Depth_First Search to get all of the label path expressions, with the help of the nature2 in this paper can reducing the number of path matching, we can generate all frequent label path expressions by layer. Finally, with the strategy of deletion we can get all of the longest frequent label path expressions effectively. Theoretical analysis and Experimental result shows that this algorithm can improve the accuracy of target model and reduce the size of candidate sets in pattern extraction.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [41] Low-Dimensionality Information Extraction Model for Semi-structured Documents
    Belhadj, Djedjiga
    Belaïd, Abdel
    Belaïd, Yolande
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14184 LNCS : 76 - 85
  • [42] Adaptive retrieval of semi-structured data
    Ben-Asher, Yosi
    Berkovsky, Shlomo
    Busetta, Paolo
    Eytani, Yaniv
    Jbara, Sadek
    Kuflik, Tsvi
    ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, 2008, 5149 : 32 - +
  • [43] A Survey on the Semi-Structured Data Models
    Chakraborty, Supriya
    Chaki, Nabendu
    COMPUTER INFORMATION SYSTEMS - ANALYSIS AND TECHNOLOGIES, 2011, 245 : 257 - +
  • [44] Specification and Verification for Semi-Structured Data
    CHEN Tao-lue
    WuhanUniversityJournalofNaturalSciences, 2006, (01) : 107 - 112
  • [45] Generic organization of semi-structured data
    Chakraborty, Supriya
    Chaki, Nabendu
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2014, 29 (01): : 65 - 74
  • [46] Schema based data storage and query optimization for semi-structured data
    Wang, QK
    Zhou, LZ
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 389 - 398
  • [47] Data Integration Approach for Semi-structured and Structured Data (Linked Data)
    Kettouch, Mohamed Salah
    Luca, Cristina
    Hobbs, Mike
    Fatima, Arooj
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2015, : 820 - 825
  • [48] An automated integration approach for semi-structured and structured data
    Lim, SJ
    Ng, YK
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 12 - 21
  • [49] Schemas for integration and translation of structured and semi-structured data
    Beeri, C
    Milo, T
    DATABASE THEORY - ICDT'99, 1999, 1540 : 296 - 313
  • [50] Research on new product structure model based on semi-structured data for virtual enterprise
    Li, XY
    Dong, Z
    Guo, AD
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 838 - 842