Estimating confidence intervals for structural differences between contrast groups with missing data

被引:2
|
作者
Qin, Yongsong [1 ]
Zhang, Shichao [1 ,2 ]
Zhu, Xiaofeng [1 ]
Zhang, Jilian [1 ]
Zhang, Chengqi [2 ]
机构
[1] Guangxi Normal Univ, Sch Comp Sci & Informat Technol, Guilin, Peoples R China
[2] Univ Technol Sydney, Fac Informat Technol, Broadway, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Difference detection; Confidence interval; Missing data imputation; LIKELIHOOD-BASED INFERENCE; MINING CHANGES; IMPUTATION;
D O I
10.1016/j.eswa.2008.07.068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Difference detection is actual and extremely useful for evaluating a new medicine B against a specified disease by comparing to an old medicine A, which has been used to treat the disease for many years. The datasets generated by applying A and B to the disease are called contrast groups and, main differences between the groups are the mean and distribution differences, referred to structural differences in this paper. However, contrast groups are only two samples obtained by limited applications or tests on A and B, and may be with missing values. Therefore, the differences derived from the groups are inevitably uncertain. In this paper, we propose a statistically sound approach for measuring this uncertainty by identifying the confidence intervals of structural differences between contrast groups. This method is designed significantly against most of those applications whose exact data distributions are unknown a priori, and the data may also be with missing values. We apply our approach to LICI datasets to illustrate its power as a new data mining technique for, such as, distinguishing spam from non-spam emails; and the benign breast cancer from the malign one. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6431 / 6438
页数:8
相关论文
共 50 条
  • [41] Simultaneous Confidence Intervals for All Pairwise Differences between Means of Weibull Distributions
    La-ongkaew, Manussaya
    Niwitpong, Sa-Aat
    Niwitpong, Suparat
    SYMMETRY-BASEL, 2023, 15 (12):
  • [42] Estimating confidence intervals for principal component loadings: A comparison between the bootstrap and asymptotic results
    Timmerman, Marieke E.
    Kiers, Henk A. L.
    Smilde, Age K.
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2007, 60 : 295 - 314
  • [43] More accurate, calibrated bootstrap confidence intervals for estimating the correlation between two time series
    K. B. Ólafsdóttir
    M. Mudelsee
    Mathematical Geosciences, 2014, 46 : 411 - 427
  • [44] More accurate, calibrated bootstrap confidence intervals for estimating the correlation between two time series
    Olafsdottir, K. B.
    Mudelsee, M.
    MATHEMATICAL GEOSCIENCES, 2014, 46 (04) : 411 - 427
  • [45] The case-control study as data missing by design: Estimating risk differences
    Wacholder, S
    EPIDEMIOLOGY, 1996, 7 (02) : 144 - 150
  • [46] Improved confidence intervals for the difference between binomial proportions based on paired data
    Newcombe, RG
    STATISTICS IN MEDICINE, 1998, 17 (22) : 2635 - 2650
  • [47] Improved confidence intervals for the difference between binomial proportions based on paired data
    Tango, T
    STATISTICS IN MEDICINE, 1999, 18 (24) : 3511 - 3513
  • [48] On the interpretation of differences between groups for compositional data
    Martín-Fernández, Josep-Antoni
    Daunis-I-estadella, Josep
    Mateu-Figueras, Glòria
    SORT, 2015, 39 (02): : 231 - 252
  • [49] On the interpretation of differences between groups for compositional data
    Martin-Fernandez, Josep-Antoni
    Daunis-i-Estadella, Josep
    Mateu-Figueras, Gloria
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2015, 39 (02) : 231 - 252
  • [50] On Preliminary-prediction Intervals for the Difference Between Two Means with Missing Data
    Niwitpong, Sa-aat
    Paksaranuwat, Pawat
    Niwitpong, Suparat
    CHIANG MAI JOURNAL OF SCIENCE, 2010, 37 (01): : 21 - 28