Publishing Time-Series Data under Preservation of Privacy and Distance Orders

被引:0
|
作者
Moon, Yang-Sae [1 ]
Kim, Hea-Suk [1 ]
Kim, Sang-Pil [1 ]
Bertino, Elisa [2 ]
机构
[1] Kangwon Natl Univ, Dept Comp Sci, Chunchon, Kangweon, South Korea
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
关键词
data mining; time-series data; privacy preservation; similarity search; data perturbation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we address the problem of preserving mining accuracy as well as privacy in publishing sensitive time-series data. For example, people with heart disease do not want to disclose their electrocardiogram time-series, but they still allow mining of some accurate patterns from their time-series. Based on this observation, we introduce the related assumptions and requirements. We show that only randomization methods satisfy all assumptions, but even those methods do not satisfy the requirements. Thus, we discuss the randomization-based solutions that satisfy all assumptions and requirements. For this purpose, we use the noise averaging effect of piecewise aggregate approximation (PAA), which may alleviate the problem of destroying distance orders in randomly perturbed time-series. Based on the noise averaging effect, we first propose two naive solutions that use the random data perturbation in publishing time-series while exploiting the PAA distance in computing distances. There is, however, a tradeoff between these two solutions with respect to uncertainty and distance orders. We thus propose two more advanced solutions that take advantages of both naive solutions. Experimental results show that our advanced solutions are superior to the naive solutions.
引用
收藏
页码:17 / +
页数:3
相关论文
共 50 条
  • [41] Privacy Preservation for Attribute Order Sensitive Workload in Medical Data Publishing
    Gao Ai-qiang
    Diao Lu-hong
    2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 1140 - +
  • [42] Publishing histograms with outliers under data differential privacy
    Han, Qilong
    Shao, Bo
    Li, Lijie
    Ma, Zhiqiang
    Zhang, Haitao
    Du, Xiaojiang
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (14) : 2313 - 2322
  • [43] Fuzzy data mining for time-series data
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    APPLIED SOFT COMPUTING, 2012, 12 (01) : 536 - 542
  • [44] Design of a Distance Learning Supervision System Based on Time-Series Data of Learning Behaviors
    Zhu, Yan
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2025, 29 (02) : 337 - 348
  • [45] Weighted z-Distance-Based Clustering and Its Application to Time-Series Data
    Wang, Zhao-Yu
    Wu, Chen-Yu
    Lin, Yan-Ting
    Lee, Shie-Jue
    APPLIED SCIENCES-BASEL, 2019, 9 (24):
  • [46] Prefix-querying with an LI distance metric for time-series subsequence matching under time warping
    Park, Sanghyun
    Kim, Sang-Wook
    JOURNAL OF INFORMATION SCIENCE, 2006, 32 (05) : 387 - 399
  • [47] Spectral analysis of time-series data
    Gregson, RAM
    CONTEMPORARY PSYCHOLOGY-APA REVIEW OF BOOKS, 1999, 44 (04): : 306 - 309
  • [48] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [49] MEASURING INSTABILITY OF TIME-SERIES DATA
    CUDDY, JDA
    DELLAVALLE, PA
    OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 1978, 40 (01) : 79 - 85
  • [50] MEASURING THE INSTABILITY OF TIME-SERIES DATA
    DUGGAN, JE
    OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 1979, 41 (03) : 239 - 246