SAT-based Decision Tree Learning for Large Data Sets

被引:0
|
作者
Schidler, Andre [1 ]
Szeider, Stefan [1 ]
机构
[1] TU Wien, Algorithms & Complex Grp, Vienna, Austria
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
基金
奥地利科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision trees of low depth are beneficial for understanding and interpreting the data they represent. Unfortunately, finding a decision tree of lowest depth that correctly represents given data is NP-hard. Hence known algorithms either (i) utilize heuristics that do not optimize the depth or (ii) are exact but scale only to small or medium-sized instances. We propose a new hybrid approach to decision tree learning, combining heuristic and exact methods in a novel way. More specifically, we employ SAT encodings repeatedly to local parts of a decision tree provided by a standard heuristic, leading to a global depth improvement. This allows us to scale the power of exact SAT-based methods to almost arbitrarily large data sets. We evaluate our new approach experimentally on a range of real-world instances that contain up to several thousand samples. In almost all cases, our method successfully decreases the depth of the initial decision tree; often, the decrease is significant.
引用
收藏
页码:3904 / 3912
页数:9
相关论文
共 50 条
  • [41] Privacy Preserving Decision Tree Learning Using Unrealized Data Sets
    Fong, Pui K.
    Weber-Jahnke, Jens H.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (02) : 353 - 364
  • [42] Incremental SAT instance generation for SAT-based ATPG
    Tille, Daniel
    Drechsler, Rolf
    2008 IEEE WORKSHOP ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, PROCEEDINGS, 2008, : 68 - 73
  • [43] Choosing Decision Tree-Based Boundary Patterns in the Intrusion Detection Systems with Large Data Sets
    Ghaffari, Hamidreza
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (03) : 363 - 369
  • [44] Ranking with Multiple Reference Points: Efficient SAT-based learning procedures
    Belahcene, Khaled
    Mousseau, Vincent
    Ouerdane, Wassila
    Pirlot, Marc
    Sobrie, Olivier
    COMPUTERS & OPERATIONS RESEARCH, 2023, 150
  • [45] SAT-based cooperative planning: A proposal
    Benedetti, M
    Aiello, LC
    MECHANIZING MATHEMATICAL REASONING: ESSAYS IN HONOUR OF JORG H SIEKMANN ON THE OCCASION OF HIS 60TH BIRTHDAY, 2005, 2605 : 494 - 513
  • [46] SAT-Based verification of LTL formulas
    Zhang, Wenhui
    FORMAL METHODS: APPLICATIONS AND TECHNOLOGY, 2007, 4346 : 277 - 292
  • [47] SAT-Based Minimization of Deterministic ω-Automata
    Baarir, Souheib
    Duret-Lutz, Alexandre
    LOGIC FOR PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND REASONING, (LPAR-20 2015), 2015, 9450 : 79 - 87
  • [48] A SAT-based algorithm for context matching
    Bouquet, P
    Magnini, B
    Serafini, L
    Zanobini, S
    MODELING AND USING CONTEXT, PROCEEDINGS, 2003, 2680 : 66 - 79
  • [49] The SAT-based Approach to Separation Logic
    Alessandro Armando
    Claudio Castellini
    Enrico Giunchiglia
    Marco Maratea
    Journal of Automated Reasoning, 2005, 35 : 237 - 263
  • [50] SATMCS: An Efficient SAT-Based Algorithm and Its Improvements for Computing Minimal Cut Sets
    Luo, Weilin
    Wei, Ou
    Wan, Hai
    IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (02) : 575 - 589