Sequential estimate for linear regression models with uncertain number of effective variables

被引:8
|
作者
Wang, Zhanfeng [1 ]
Chang, Yuan-chin Ivan [2 ]
机构
[1] Univ Sci & Technol China, Dept Stat & Finance, Hefei 230026, Peoples R China
[2] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan
基金
中国国家自然科学基金;
关键词
Confidence set; Shrinkage estimation; Stochastic regression; Stopping time;
D O I
10.1007/s00184-012-0426-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
As a result of novel data collection technologies, it is now common to encounter data in which the number of explanatory variables collected is large, while the number of variables that actually contribute to the model remains small. Thus, a method that can identify those variables with impact on the model without inferring other noneffective ones will make analysis much more efficient. Many methods are proposed to resolve the model selection problems under such circumstances, however, it is still unknown how large a sample size is sufficient to identify those "effective" variables. In this paper, we apply sequential sampling method so that the effective variables can be identified efficiently, and the sampling is stopped as soon as the "effective" variables are identified and their corresponding regression coefficients are estimated with satisfactory accuracy, which is new to sequential estimation. Both fixed and adaptive designs are considered. The asymptotic properties of estimates of the number of effective variables and their coefficients are established, and the proposed sequential estimation procedure is shown to be asymptotically optimal. Simulation studies are conducted to illustrate the performance of the proposed estimation method, and a diabetes data set is used as an example.
引用
收藏
页码:949 / 978
页数:30
相关论文
共 50 条
  • [1] Sequential estimate for linear regression models with uncertain number of effective variables
    Zhanfeng Wang
    Yuan-chin Ivan Chang
    Metrika, 2013, 76 : 949 - 978
  • [2] Sequential Estimate for Generalized Linear Models with Uncertain Number of Effective Variables
    LU Haibo
    WANG Zhanfeng
    WU Yaohua
    Journal of Systems Science & Complexity, 2015, 28 (02) : 424 - 438
  • [3] Sequential estimate for generalized linear models with uncertain number of effective variables
    Lu Haibo
    Wang Zhanfeng
    Wu Yaohua
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2015, 28 (02) : 424 - 438
  • [4] Sequential estimate for generalized linear models with uncertain number of effective variables
    Haibo Lu
    Zhanfeng Wang
    Yaohua Wu
    Journal of Systems Science and Complexity, 2015, 28 : 424 - 438
  • [5] LINEAR REGRESSION CRITERION FOR ESTIMATE OF NUMBER OF FACTORS
    KASHIWAGI, S
    ISHIZUKA, T
    JAPANESE JOURNAL OF PSYCHOLOGY, 1973, 44 (04): : 167 - 178
  • [6] SELECTION OF VARIABLES IN LINEAR REGRESSION MODELS
    SAWA, T
    ECONOMETRICA, 1968, 36 (5S) : 6 - &
  • [7] An estimate of potential blueberry yield using regression models that relate the number of fruits to the number of flower buds and to climatic variables
    Salvo, Sonia
    Munoz, Carlos
    Avila, Julio
    Bustos, Jaime
    Ramirez-Valdivia, Martha
    Silva, Carolina
    Vivallo, Gabriel
    SCIENTIA HORTICULTURAE, 2012, 133 : 56 - 63
  • [8] PREDICTION PERFORMANCE AND THE NUMBER OF VARIABLES IN MULTIVARIATE LINEAR-REGRESSION
    STEERNEMAN, T
    LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1984, 237 : 118 - 129
  • [9] Joint optimization of linear and nonlinear models for sequential regression
    Fazla, Arda
    Aydin, Mustafa E.
    Kozat, Suleyman S.
    DIGITAL SIGNAL PROCESSING, 2022, 132
  • [10] AN OVERVIEW OF LINEAR STRUCTURAL MODELS IN ERRORS IN VARIABLES REGRESSION
    Gillard, Jonathan
    REVSTAT-STATISTICAL JOURNAL, 2010, 8 (01) : 57 - 80