Reasoning about integrity constraints for tree-structured data

被引:0
|
作者
Wojciech Czerwiński
Claire David
Filip Murlak
Paweł Parys
机构
[1] University of Warsaw,
[2] Université Paris-Est Marne-la-Vallée,undefined
来源
关键词
Data trees; Integrity constraints; Unions of conjunctive queries; Schema mappings; Entailment; Containment; Consistency;
D O I
暂无
中图分类号
学科分类号
摘要
We study a class of integrity constraints for tree-structured data modelled as data trees, whose nodes have a label from a finite alphabet and store a data value from an infinite data domain. The constraints require each tuple of nodes selected by a conjunctive query (using navigational axes and labels) to satisfy a positive combination of equalities and a positive combination of inequalities over the stored data values. Such constraints are instances of the general framework of XML-to-relational constraints proposed recently by Niewerth and Schwentick. They cover some common classes of constraints, including W3C XML Schema key and unique constraints, as well as domain restrictions and denial constraints, but cannot express inclusion constraints, such as reference keys. Our main result is that consistency of such integrity constraints with respect to a given schema (modelled as a tree automaton) is decidable. An easy extension gives decidability for the entailment problem. Equivalently, we show that validity and containment of unions of conjunctive queries using navigational axes, labels, data equalities and inequalities is decidable, as long as none of the conjunctive queries uses both equalities and inequalities; without this restriction, both problems are known to be undecidable. In the context of XML data exchange, our result can be used to establish decidability for a consistency problem for XML schema mappings. All the decision procedures are doubly exponential, with matching lower bounds. The complexity may be lowered to singly exponential, when conjunctive queries are replaced by tree patterns, and the number of data comparisons is bounded.
引用
收藏
页码:941 / 976
页数:35
相关论文
共 50 条
  • [1] Reasoning about integrity constraints for tree-structured data
    Czerwinski, Wojciech
    David, Claire
    Murlak, Filip
    Parys, Pawel
    THEORY OF COMPUTING SYSTEMS, 2018, 62 (04) : 941 - 976
  • [2] A tree-covering problem arising in integrity of tree-structured data
    Atallah, Mikhail J.
    Frederickson, Greg N.
    Kundu, Ashish
    INFORMATION PROCESSING LETTERS, 2008, 109 (01) : 79 - 82
  • [3] Clustering of Tree-structured Data
    Lu, Na
    Wu, Yidan
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 1210 - 1215
  • [4] Watermarking abstract tree-structured data
    Chen, G
    Chen, K
    Hu, TL
    Dong, JX
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 221 - 232
  • [5] OODA of graph and tree-structured data
    Sienkiewicz, Ela
    Wang, Haonan
    BIOMETRICAL JOURNAL, 2014, 56 (05) : 778 - 780
  • [6] Anonymizing Collections of Tree-Structured Data
    Gkountouna, Olga
    Terrovitis, Manolis
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1520 - 1521
  • [7] Tree-structured Clustering for Continuous Data
    Huh, Myung-Hoe
    Yang, Kyung-Sook
    KOREAN JOURNAL OF APPLIED STATISTICS, 2005, 18 (03) : 661 - 671
  • [8] Anonymizing Collections of Tree-Structured Data
    Gkountouna, Olga
    Terrovitis, Manolis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (08) : 2034 - 2048
  • [9] Substructure search with tree-structured data
    Ozawa, K
    Yasuda, T
    Fujita, S
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (04): : 688 - 695
  • [10] SQL-like query language and referential constraints on tree-structured data
    Afrati, Foto N.
    Damigos, Matthew George
    Stasinopoulos, Nikos
    IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 1 - 10