Similarity measurement of XML documents based on structure and contents

被引:0
|
作者
Kim, Tae-Soon [1 ]
Lee, Ju-Hong [1 ]
Song, Jae-Won [1 ]
Kim, Deok-Hwan [2 ]
机构
[1] Inha Univ, Dept Comp Sci & Informat Engn, Incheon, South Korea
[2] Inha Univ, Dept Elect Engn, Incheon, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XNIL documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.
引用
收藏
页码:902 / +
页数:2
相关论文
共 50 条
  • [21] A Bloom Filter Based Approach for Evaluating Structural Similarity of XML Documents
    Peng, Dunlu
    Hou, Huan
    Lu, Jing
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 242 - 251
  • [22] Structural similarity between XML documents and DTDs
    Ng, PKL
    Ng, VTY
    COMPUTATIONAL SICENCE - ICCS 2003, PT III, PROCEEDINGS, 2003, 2659 : 412 - 421
  • [23] Using structural similarity for clustering XML documents
    Aitelhadj, Ali
    Boughanem, Mohand
    Mezghiche, Mohamed
    Souam, Fatiha
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 109 - 139
  • [24] Using structural similarity for clustering XML documents
    Ali Aïtelhadj
    Mohand Boughanem
    Mohamed Mezghiche
    Fatiha Souam
    Knowledge and Information Systems, 2012, 32 : 109 - 139
  • [25] Semantic Structural Similarity for Clustering XML Documents
    Kim, Tae-Soon
    Lee, Ju-Hong
    Song, Jae-Won
    ICHIT 2008: INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 552 - 557
  • [26] Content and structure based approach for XML similarity
    Ma, YH
    Chbeir, R
    Fifth International Conference on Computer and Information Technology - Proceedings, 2005, : 136 - 140
  • [27] Clustering of XML Documents Based on Structure and Aggregated Content
    Rezk, Nermeen Gamal
    Sarhan, Amany
    Algergawy, Alsaved
    PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 93 - 102
  • [28] Clustering XML documents by structure based on common neighbor
    Zhang, XZ
    Lv, TY
    Wang, ZX
    Zuo, WL
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 771 - 776
  • [29] Feature- and query-based table of contents generation for XML documents
    Szlavik, Zoltan
    Tombros, Anastasios
    Lalmas, Mounia
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 456 - +
  • [30] Clustering XML documents by structure
    Dalamagas, T
    Cheng, T
    Winkel, KJ
    Sellis, T
    METHODS AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3025 : 112 - 121