Model-Assisted Estimation Through Random Forests in Finite Population Sampling

被引:19
|
作者
Dagdoug, Mehdi [1 ]
Goga, Camelia [1 ]
Haziza, David [2 ]
机构
[1] Univ Bourgogne Franche Comte, Lab Math Besancon, Besancon, France
[2] Univ Ottawa, Dept Math & Stat, 150 Louis Pasteur Private, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Model-assisted approach; Model-calibration; Nonparametric regression; Random forest; Survey data; Variance estimation; ASYMPTOTIC CONFIDENCE BANDS; AUXILIARY INFORMATION; VARIANCE REDUCTION; SURVEY DESIGN; APPROXIMATION;
D O I
10.1080/01621459.2021.1987250
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In surveys, the interest lies in estimating finite population parameters such as population totals and means. In most surveys, some auxiliary information is available at the estimation stage. This information may be incorporated in the estimation procedures to increase their precision. In this article, we use random forests (RFs) to estimate the functional relationship between the survey variable and the auxiliary variables. In recent years, RFs have become attractive as National Statistical Offices have now access to a variety of data sources, potentially exhibiting a large number of observations on a large number of variables. We establish the theoretical properties of model-assisted procedures based on RFs and derive corresponding variance estimators. A model-calibration procedure for handling multiple survey variables is also discussed. The results of a simulation study suggest that the proposed point and estimation procedures perform well in terms of bias, efficiency and coverage of normal-based confidence intervals, in a wide variety of settings. Finally, we apply the proposed methods using data on radio audiences collected by Mediametrie, a French audience company. Supplementary materials for this article are available online.
引用
收藏
页码:1234 / 1251
页数:18
相关论文
共 50 条
  • [41] Finite-population variance estimation under systematic sampling schemes with multiple random starts
    Sampath, S.
    Ammani, S.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2012, 82 (08) : 1207 - 1221
  • [42] Estimation of finite population mean in simple and stratified random sampling using two auxiliary variables
    Shabbir, Javid
    Gupta, Sat
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (20) : 10135 - 10148
  • [43] MODEL-ASSISTED AND MODEL-CALIBRATED ESTIMATION FOR CLASS FREQUENCIES WITH ORDINAL OUTCOMES
    del Mar Rueda, Maria
    Arcos, Antonio
    Molina, David
    Trujillo, Manuel
    REVSTAT-STATISTICAL JOURNAL, 2018, 16 (03) : 323 - 348
  • [44] On Random Sampling Without Replacement from a Finite Population
    Subhash C. Kochar
    Ramesh Korwar
    Annals of the Institute of Statistical Mathematics, 2001, 53 : 631 - 646
  • [45] On random sampling without replacement from a finite population
    Kochar, SC
    Korwar, R
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2001, 53 (03) : 631 - 646
  • [46] Model-assisted estimation of forest resources with generalized additive models - Rejoinder
    Opsomer, Jean D.
    Breidt, F. Jay
    Moisen, Gretchen G.
    Kauermann, Gran
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) : 415 - 416
  • [47] Network model-assisted inference from respondent-driven sampling data
    Gile, Krista J.
    Handcock, Mark S.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2015, 178 (03) : 619 - 639
  • [48] Model-Assisted Estimation of Tropical Forest Biomass Change: A Comparison of Approaches
    Knapp, Nikolai
    Huth, Andreas
    Kugler, Florian
    Papathanassiou, Konstantinos
    Condit, Richard
    Hubbell, Stephen P.
    Fischer, Rico
    REMOTE SENSING, 2018, 10 (05)
  • [49] Screening for Cognitive Impairment by Model-Assisted Cerebral Blood Flow Estimation
    Lassila, Toni
    Di Marco, Luigi Yuri
    Mitolo, Micaela
    Iaia, Vincenzo
    Levedianos, Giorgio
    Venneri, Annalena
    Frangi, Alejandro F.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2018, 65 (07) : 1654 - 1661
  • [50] Model-assisted estimation of forest resources with generalized additive models - Comment
    Christman, Mary C.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) : 411 - 412