Estimating confidence intervals for structural differences between contrast groups with missing data

被引:2
|
作者
Qin, Yongsong [1 ]
Zhang, Shichao [1 ,2 ]
Zhu, Xiaofeng [1 ]
Zhang, Jilian [1 ]
Zhang, Chengqi [2 ]
机构
[1] Guangxi Normal Univ, Sch Comp Sci & Informat Technol, Guilin, Peoples R China
[2] Univ Technol Sydney, Fac Informat Technol, Broadway, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Difference detection; Confidence interval; Missing data imputation; LIKELIHOOD-BASED INFERENCE; MINING CHANGES; IMPUTATION;
D O I
10.1016/j.eswa.2008.07.068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Difference detection is actual and extremely useful for evaluating a new medicine B against a specified disease by comparing to an old medicine A, which has been used to treat the disease for many years. The datasets generated by applying A and B to the disease are called contrast groups and, main differences between the groups are the mean and distribution differences, referred to structural differences in this paper. However, contrast groups are only two samples obtained by limited applications or tests on A and B, and may be with missing values. Therefore, the differences derived from the groups are inevitably uncertain. In this paper, we propose a statistically sound approach for measuring this uncertainty by identifying the confidence intervals of structural differences between contrast groups. This method is designed significantly against most of those applications whose exact data distributions are unknown a priori, and the data may also be with missing values. We apply our approach to LICI datasets to illustrate its power as a new data mining technique for, such as, distinguishing spam from non-spam emails; and the benign breast cancer from the malign one. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6431 / 6438
页数:8
相关论文
共 50 条
  • [11] Confidence intervals for the difference between two means with missing data following a preliminary test
    Paksaranuwat, Pawat
    Niwitpong, Sa-aat
    SCIENCEASIA, 2009, 35 (03): : 310 - 313
  • [12] Monte Carlo confidence intervals for the indirect effect with missing data
    Pesigan, Ivan Jacob Agaloos
    Cheung, Shu Fai
    BEHAVIOR RESEARCH METHODS, 2024, 56 (03) : 1678 - 1696
  • [13] Monte Carlo confidence intervals for the indirect effect with missing data
    Ivan Jacob Agaloos Pesigan
    Shu Fai Cheung
    Behavior Research Methods, 2024, 56 : 1678 - 1696
  • [14] Using GAs to estimate confidence intervals for missing spatial data
    Eklund, NH
    SMCIA/03: PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL WORKSHOP ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS, 2003, : 91 - 95
  • [15] Estimating DEA confidence intervals with statistical panel data analysis
    Barnum, Darold T.
    Gleason, John M.
    Karlaftis, Matthew G.
    Schumock, Glen T.
    Shields, Karen L.
    Tandon, Sonali
    Walton, Surrey M.
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (04) : 815 - 828
  • [16] Wilson Confidence Intervals for Binomial Proportions With Multiple Imputation for Missing Data
    Lott, Anne
    Reiter, Jerome P.
    AMERICAN STATISTICIAN, 2020, 74 (02): : 109 - 115
  • [17] Confidence intervals for a common mean with missing data with applications in an AIDS study
    Liang, Hua
    Su, Haiyan
    Zou, Guohua
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 53 (02) : 546 - 553
  • [18] Empirical Likelihood Confidence Intervals for Response Mean with Data Missing at Random
    Xue, Liugen
    SCANDINAVIAN JOURNAL OF STATISTICS, 2009, 36 (04) : 671 - 685
  • [19] Using genetic algorithms to estimate confidence intervals for missing spatial data
    Eklund, Neil H. W.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2006, 36 (04): : 519 - 524
  • [20] A GEE approach to estimating accuracy and its confidence intervals for correlated data
    Lim, Yaeji
    PHARMACEUTICAL STATISTICS, 2020, 19 (01) : 59 - 70