An Algorithm of Semi-structured Data Scheme Extraction Based on OEM Model

被引:0
|
作者
Gong, An [1 ]
Yang, Xue-wei [1 ]
机构
[1] China Univ Petr E China, Coll Comp & Commun Engn, Dongying 257061, Peoples R China
关键词
Semi-structured data; frequent patterns mining; OEM; the longest frequent label path;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to get the target model of semi-structured data rapidly, effectively and accurately, by combining the related nature of label path in the paper, this paper proposes an algorithm that can extract target model from the OEM model of semi-structured data directly. The basic idea of the Algorithm is: Using a Depth_First Search to get all of the label path expressions, with the help of the nature2 in this paper can reducing the number of path matching, we can generate all frequent label path expressions by layer. Finally, with the strategy of deletion we can get all of the longest frequent label path expressions effectively. Theoretical analysis and Experimental result shows that this algorithm can improve the accuracy of target model and reduce the size of candidate sets in pattern extraction.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [1] Schema Discovery of Semi-structured Hierarchical Data Based on OEM Model and Hierarchical Transactional Database
    Lv, Cheng
    Wei, Chu-yuan
    Hao, Ying
    ICMECG: 2009 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT, PROCEEDINGS, 2009, : 172 - 175
  • [2] Semi-structured data protection scheme based on robust watermarking
    Jiahuan He
    Qichao Ying
    Zhenxing Qian
    Guorui Feng
    Xinpeng Zhang
    EURASIP Journal on Image and Video Processing, 2020
  • [3] Semi-structured data protection scheme based on robust watermarking
    He, Jiahuan
    Ying, Qichao
    Qian, Zhenxing
    Feng, Guorui
    Zhang, Xinpeng
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [4] List data extraction in semi-structured document
    Xu, H
    Li, JZ
    Xu, P
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 584 - 585
  • [5] Knowledge extraction from semi-structured data based on fuzzy techniques
    Ceravolo, P
    Nocerino, MC
    Viviani, M
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2004, 3215 : 328 - 334
  • [6] Graph-based Retrieval Model for Semi-structured Data
    Park, Juneyoung
    Yi, Mun Y.
    2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 361 - 364
  • [7] Analyzing semi-structured data for ontological information extraction
    Han, H
    Elmasri, R
    IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II, 2001, : 21 - 27
  • [8] Semi-structured Data Extraction and Schema Knowledge Mining
    陈恩红
    High Technology Letters, 2001, (01) : 1 - 5
  • [9] Approximate graph schema extraction for semi-structured data
    Wang, QY
    Yu, JX
    Wong, KF
    ADVANCES IN DATABSE TECHNOLOGY-EDBT 2000, PROCEEDINGS, 2000, 1777 : 302 - 316
  • [10] SEMI-STRUCTURED DOCUMENT EXTRACTION BASED ON DOCUMENT ELEMENT BLOCK MODEL
    Lv, Tao
    Liu, Jiang
    Lu, Fan
    Zhang, Peng
    Wang, Xinyan
    Wang, Cong
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 461 - 465