A new approach for data editing and imputation

被引:0
|
作者
Sergio Delgado-Quintero
Juan-José Salazar-González
机构
[1] Universidad de La Laguna,DEIOC
关键词
Editing; Imputation; Error localization problem; Mathematical Programming; Heuristics;
D O I
暂无
中图分类号
学科分类号
摘要
The editing-and-imputation problem concerns the question of finding errors in a record which does not satisfy a set of consistency rules. Once some potential errors have been localizated, it is also necessary to impute new values to the associated fields. The output dataset should consist of valid records and preserve similar statistical properties as the input dataset. Most of this work is usually done manually by statistical agencies, thus consuming a great deal of human resources. This paper presents a mathematical programming model to optimally solve the problem on surveys with categorical values and particular edits. We also describe a heuristic approach to deal with the more complex surveys. The heuristic procedure follows a combination of the widely-accepted hot-deck donor scheme and the multivariate regression analysis. It has been implemented in a graphical user interface running on standard personal computers, and has been tested on real-world surveys. This paper demonstrates the satisfactory performance of our automatic procedure.
引用
收藏
相关论文
共 50 条
  • [1] A new approach for data editing and imputation
    Delgado-Quintero, Sergio
    Salazar-Gonzalez, Juan-Jose
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 68 (03) : 407 - 428
  • [2] New methods of editing and imputation
    Nordbotten, S
    AGRICULTURAL STATISTICS 2000, PROCEEDINGS: AN INTERNATIONAL CONFERENCE ON AGRICULTURAL STATISTICS, 1998, : 220 - 228
  • [3] EDITING AND IMPUTATION FOR QUANTITATIVE SURVEY DATA
    LITTLE, RJA
    SMITH, PJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (397) : 58 - 68
  • [4] Handbook of Statistical Data Editing and Imputation
    Kosler, Joseph S.
    JOURNAL OF OFFICIAL STATISTICS, 2013, 29 (02) : 324 - 327
  • [5] A New Approach for Missing Data Imputation in Big Data Interface
    Wang, Chunzhi
    Shakhovska, Nataliya
    Sachenko, Anatoliy
    Komar, Myroslav
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (04): : 541 - 555
  • [6] Data Editing and Imputation in Business Surveys Using "R"
    Romascanu, Elena
    ROMANIAN STATISTICAL REVIEW, 2014, (02) : 129 - 146
  • [7] A new approach to imputation
    Cohen, MP
    AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE SECTION ON SURVEY RESEARCH METHODS, VOLS I AND II, 1996, : 293 - 298
  • [8] A Probabilistic Approach for Missing Data Imputation
    Arefin, Muhammed Nazmul
    Masum, Abdul Kadar Muhammad
    COMPLEXITY, 2024, 2024
  • [9] DATA IMPUTATION: AN OPTIMIZATION APPROACH.
    Cooley, Philip C.
    International Journal on Policy and Information, 1987, 11 (01): : 39 - 45
  • [10] Combining editing and imputation methods: an experimental application on population census data
    Manzari, A
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2004, 167 : 295 - 307