Differentially Private k-Nearest Neighbor Missing Data Imputation

被引:4
|
作者
Clifton, Chris [1 ]
Hanson, Eric J. [2 ]
Merrill, Keith [3 ]
Merrill, Shawn [1 ]
机构
[1] Purdue Univ, 305 N Univ St, W Lafayette, IN 47906 USA
[2] Univ Quebec Montreal, Lab Combinatoire & Informat Math, Montreal, PQ H3C 3P8, Canada
[3] Brandeis Univ, 415 South St, Waltham, MA 02453 USA
关键词
Differential privacy; statistical disclosure limitation; private data cleaning; smooth sensitivity;
D O I
10.1145/3507952
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using techniques employing smooth sensitivity, we develop a method for k-nearest neighbor missing data imputation with differential privacy. This requires bounding the number of data incomplete tuples that can have their data complete "donor" changed by making a single addition or deletion to the dataset. The multiplicity of a single individual's impact on an imputed dataset necessarily means our mechanisms require the addition of more noise than mechanisms that ignore missing data, but we show empirically that this is significantly outweighed by the bias reduction from imputing missing data.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Improved k-nearest neighbor classification
    Wu, YQ
    Ianakiev, K
    Govindaraju, V
    PATTERN RECOGNITION, 2002, 35 (10) : 2311 - 2318
  • [32] Navigating K-Nearest Neighbor Graphs to Solve Nearest Neighbor Searches
    Chavez, Edgar
    Sadit Tellez, Eric
    ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 270 - 280
  • [33] A Centroid k-Nearest Neighbor Method
    Zhang, Qingjiu
    Sun, Shiliang
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 278 - 285
  • [34] Consistency of the k-Nearest Neighbor Classifier for Spatially Dependent Data
    Younso, Ahmad
    Kanaya, Ziad
    Azhari, Nour
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023, 11 (03) : 503 - 518
  • [35] Validation of k-Nearest Neighbor Classifiers
    Bax, Eric
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (05) : 3225 - 3234
  • [36] Quantum K-nearest neighbor algorithm
    Chen, Hanwu
    Gao, Yue
    Zhang, Jun
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2015, 45 (04): : 647 - 651
  • [37] Effective k-nearest neighbor models for data classification enhancement
    Ali A. Amer
    Sri Devi Ravana
    Riyaz Ahamed Ariyaluran Habeeb
    Journal of Big Data, 12 (1)
  • [38] A fuzzy K-nearest neighbor classifier to deal with imperfect data
    Jose M. Cadenas
    M. Carmen Garrido
    Raquel Martínez
    Enrique Muñoz
    Piero P. Bonissone
    Soft Computing, 2018, 22 : 3313 - 3330
  • [39] K-Nearest Neighbor Classifier for Uncertain Data in Feature Space
    Lim, Sung-Yeon
    Ko, Changwan
    Jeong, Young-Seon
    Baek, Jaeseung
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2023, 22 (04): : 414 - 421