Robust Bayesian Classification with Incomplete Data

被引:0
|
作者
Xunan Zhang
Shiji Song
Cheng Wu
机构
[1] Tsinghua University,Department of Automation
来源
Cognitive Computation | 2013年 / 5卷
关键词
Bayesian classification; Incomplete data; EM algorithm; Propensity scores;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we address the Bayesian classification with incomplete data. The common approach in the literature is to simply ignore the samples with missing values or impute missing values before classification. However, these methods are not effective when a large portion of the data have missing values and the acquisition of samples is expensive. Motivated by these limitations, the expectation maximization algorithm for learning a multivariate Gaussian mixture model and a multiple kernel density estimator based on the propensity scores are proposed to avoid listwise deletion (LD) or mean imputation (MI) for solving classification tasks with incomplete data. We illustrate the effectiveness of our proposed algorithms on some artificial and benchmark UCI data sets by comparing with LD and MI methods. We also apply these algorithms to solve the practical classification tasks on the lithology identification of hydrothermal minerals and license plate character recognition. The experimental results demonstrate their good performance with high classification accuracies.
引用
收藏
页码:170 / 187
页数:17
相关论文
共 50 条
  • [21] Classification of incomplete data by observation
    Lorrentz, Pierre
    Engineering Letters, 2011, 18 (04)
  • [22] Towards a robust incomplete data handling approach to effective educational data classification in an academic credit system
    Nguyen Truc Mai Anh
    Vo Thi Ngoc Chau
    Nguyen Hua Phung
    2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
  • [23] Robust Feature Selection on Incomplete Data
    Zheng, Wei
    Zhu, Xiaofeng
    Zhu, Yonghua
    Zhang, Shichao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3191 - 3197
  • [24] On robust linear regression with incomplete data
    Atkinson, AC
    Cheng, TC
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 361 - 380
  • [25] Estimating with incomplete count data A Bayesian approach
    J Stat Plan Inference, 1 (147):
  • [26] Incomplete categorical data analysis: A Bayesian perspective
    Soares, P
    Paulino, CD
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 69 (02) : 157 - 170
  • [27] Bayesian network models for incomplete and dynamic data
    Scutari, Marco
    STATISTICA NEERLANDICA, 2020, 74 (03) : 397 - 419
  • [28] Estimating with incomplete count data A Bayesian approach
    Moreno, E
    Giron, J
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 66 (01) : 147 - 159
  • [29] Bayesian selection of decomposable models with incomplete data
    Sebastiani, P
    Ramoni, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1375 - 1386
  • [30] Bayesian network induction with incomplete private data
    Zhan, J
    Chang, LW
    Matwin, S
    SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1119 - 1124