Differentially Private k-Nearest Neighbor Missing Data Imputation

被引:4
|
作者
Clifton, Chris [1 ]
Hanson, Eric J. [2 ]
Merrill, Keith [3 ]
Merrill, Shawn [1 ]
机构
[1] Purdue Univ, 305 N Univ St, W Lafayette, IN 47906 USA
[2] Univ Quebec Montreal, Lab Combinatoire & Informat Math, Montreal, PQ H3C 3P8, Canada
[3] Brandeis Univ, 415 South St, Waltham, MA 02453 USA
关键词
Differential privacy; statistical disclosure limitation; private data cleaning; smooth sensitivity;
D O I
10.1145/3507952
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using techniques employing smooth sensitivity, we develop a method for k-nearest neighbor missing data imputation with differential privacy. This requires bounding the number of data incomplete tuples that can have their data complete "donor" changed by making a single addition or deletion to the dataset. The multiplicity of a single individual's impact on an imputed dataset necessarily means our mechanisms require the addition of more noise than mechanisms that ignore missing data, but we show empirically that this is significantly outweighed by the bias reduction from imputing missing data.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [42] Weighted K-Nearest Neighbor Revisited
    Bicego, M.
    Loog, M.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1642 - 1647
  • [43] Efficient and secure k-nearest neighbor query on outsourced data
    Huijuan Lian
    Weidong Qiu
    Di Yan
    Zheng Huang
    Peng Tang
    Peer-to-Peer Networking and Applications, 2020, 13 : 2324 - 2333
  • [44] Microarray Data Classification using Fuzzy K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Santanu Ku
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1032 - 1038
  • [45] A FUZZY K-NEAREST NEIGHBOR ALGORITHM
    KELLER, JM
    GRAY, MR
    GIVENS, JA
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (04): : 580 - 585
  • [46] CHROMATIC K-NEAREST NEIGHBOR QUERIES
    van der Horst, Thijs
    Loffler, Maarten
    Staals, Frank
    JOURNAL OF COMPUTATIONAL GEOMETRY, 2025, 16 (01)
  • [47] Efficient and secure k-nearest neighbor query on outsourced data
    Lian, Huijuan
    Qiu, Weidong
    Yan, Di
    Huang, Zheng
    Tang, Peng
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (06) : 2324 - 2333
  • [48] A Modified K-Nearest Neighbor Algorithm to Handle Uncertain Data
    Agrawal, Rashmi
    Ram, Babu
    2015 5TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2015,
  • [49] A fuzzy K-nearest neighbor classifier to deal with imperfect data
    Cadenas, Jose M.
    Carmen Garrido, M.
    Martinez, Raquel
    Munoz, Enrique
    Bonissone, Piero P.
    SOFT COMPUTING, 2018, 22 (10) : 3313 - 3330
  • [50] Consistency of the k-Nearest Neighbor Classifier for Spatially Dependent Data
    Ahmad Younso
    Ziad Kanaya
    Nour Azhari
    Communications in Mathematics and Statistics, 2023, 11 : 503 - 518