The use of uncertainty to choose matching variables in statistical matching

被引:4
|
作者
D'Orazio, Marcello [1 ]
Di Zio, Marco [2 ]
Scanu, Mauro [2 ]
机构
[1] UN, FAO, Rome, Italy
[2] Ist Nazl Stat ISTAT, Rome, Italy
关键词
Data fusion; Synthetical matching; Consistency; Partial identifiability; PARTIALLY IDENTIFIED PARAMETERS; CONFIDENCE-INTERVALS;
D O I
10.1016/j.ijar.2017.08.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical matching aims at combining information available in distinct sample surveys referred to the same target population. The matching is usually based on a set of common variables shared by the available data sources. For matching purposes just a subset of all the common variables should be chosen, the so called matching variables. The paper presents a novel method for selecting the matching variables based on the analysis of the uncertainty characterizing the matching. framework. The uncertainty is caused by unavailability of data for estimating parameters describing the association between variables not jointly observed in a single data source. The paper focuses on the case of categorical variables and presents a sequential procedure for identifying the most effective subset of common variables in reducing the overall uncertainty. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:433 / 440
页数:8
相关论文
共 50 条
  • [1] The Use of Uncertainty to Choose Matching Variables in Statistical Matching
    D'Orazio, Marcello
    Di Zio, Marco
    Scanu, Mauro
    SOFT METHODS FOR DATA SCIENCE, 2017, 456 : 149 - 156
  • [2] Uncertainty analysis for statistical matching of ordered categorical variables
    Conti, Pier Luigi
    Marella, Daniela
    Scanu, Mauro
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 68 : 311 - 325
  • [3] Uncertainty Analysis in Statistical Matching
    Conti, Pier Luigi
    Marella, Daniela
    Scanu, Mauro
    JOURNAL OF OFFICIAL STATISTICS, 2012, 28 (01) : 69 - 88
  • [4] Matching Patterns with Variables
    Manea, Florin
    Schmid, Markus L.
    COMBINATORICS ON WORDS, WORDS 2019, 2019, 11682 : 1 - 27
  • [5] Statistical matching and uncertainty analysis in combining household income and expenditure data
    Pier Luigi Conti
    Daniela Marella
    Andrea Neri
    Statistical Methods & Applications, 2017, 26 : 485 - 505
  • [6] Statistical matching and uncertainty analysis in combining household income and expenditure data
    Conti, Pier Luigi
    Marella, Daniela
    Neri, Andrea
    STATISTICAL METHODS AND APPLICATIONS, 2017, 26 (03): : 485 - 505
  • [7] Bronchial Asthma and COPD choose Matching Potency
    PNEUMOLOGIE, 2015, 69 (08): : 498 - 498
  • [8] Recovering Latent Variables by Matching
    Arellano, Manuel
    Bonhomme, Stephane
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (541) : 693 - 706
  • [9] ACCURACY IN THE APPLICATION OF STATISTICAL MATCHING METHODS FOR CONTINUOUS VARIABLES USING AUXILIARY DATA
    Van Delden, Arnout
    Du Chatinier, Bart J.
    Scholtus, Sander
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2020, 8 (05) : 990 - 1017
  • [10] STATUS VARIABLES AND MATCHING BEHAVIOR
    DECHARMS, R
    ROSENBAUM, ME
    JOURNAL OF PERSONALITY, 1960, 28 (04) : 492 - 502