Private Data Analytics on Biomedical Sensing Data via Distributed Computation

被引:30
|
作者
Gong, Yanmin [1 ]
Fang, Yuguang [1 ]
Guo, Yuanxiong [2 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
[2] Oklahoma State Univ, Sch Elect & Comp Engn, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Private data analytics; mobile health; predictive model training; logistic regression; LOGISTIC-REGRESSION; ANONYMIZATION; CARE;
D O I
10.1109/TCBB.2016.2515610
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Advances in biomedical sensors and mobile communication technologies have fostered the rapid growth of mobile health (mHealth) applications in the past years. Users generate a high volume of biomedical data during health monitoring, which can be used by the mHealth server for training predictive models for disease diagnosis and treatment. However, the biomedical sensing data raise serious privacy concerns because they reveal sensitive information such as health status and lifestyles of the sensed subjects. This paper proposes and experimentally studies a scheme that keeps the training samples private while enabling accurate construction of predictive models. We specifically consider logistic regression models which are widely used for predicting dichotomous outcomes in healthcare, and decompose the logistic regression problem into small subproblems over two types of distributed sensing data, i.e., horizontally partitioned data and vertically partitioned data. The subproblems are solved using individual private data, and thus mHealth users can keep their private data locally and only upload (encrypted) intermediate results to the mHealth server for model training. Experimental results based on real datasets show that our scheme is highly efficient and scalable to a large number of mHealth users.
引用
收藏
页码:431 / 444
页数:14
相关论文
共 50 条
  • [1] Temporal biomedical data analytics
    Moskovitch, Robert
    Shahar, Yuval
    Wang, Fei
    Hripcsak, George
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 90
  • [2] Unary Computation for Biomedical Data
    Kim, Seung-Youl
    Kim, Kyo-Tae
    Cho, Kyoung-Rok
    Cho, Tae Won
    You, Younggap
    WMSCI 2010: 14TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, 2010, : 34 - 38
  • [3] The Ethics of Biomedical ‘Big Data’ Analytics
    Brent Mittelstadt
    Philosophy & Technology, 2019, 32 (1) : 17 - 21
  • [4] A Distributed Elastic Net Regression Algorithm for Private Data Analytics in Internet of Things
    Fang W.
    Liu M.
    Wang Y.
    Li Y.
    An Z.
    Fang, Weiwei (wwfang@bjtu.edu.cn), 1600, Science Press (42): : 2403 - 2411
  • [5] A Distributed Elastic Net Regression Algorithm for Private Data Analytics in Internet of Things
    Fang Weiwei
    Liu Mengran
    Wang Yunpeng
    Li Yangyang
    An Zhulin
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (10) : 2403 - 2411
  • [6] Approximate Computation for Big Data Analytics
    Ma, Shuai
    DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : XVIII - XVIII
  • [7] Editorial: Sensing and Data Analytics
    Raphael, Benny
    Thomas, Albert
    Louis, Joseph
    FRONTIERS IN BUILT ENVIRONMENT, 2020, 6
  • [8] The age of data analytics: converting biomedical data into actionable insights
    Veselkov, Kirill
    Schuller, Bjoern
    METHODS, 2018, 151 : 1 - 2
  • [9] Run Data Run! Re-distributing Data via Piggybacking for Geo-distributed Data Analytics
    Li, Yefei
    Jin, Yibo
    Chen, Haiyang
    Xi, Wenchao
    Ji, Mingtao
    Zhang, Sheng
    Qian, Zhuzhong
    Lu, Sanglu
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 356 - 363
  • [10] The Data Swarm: A Next Step for Distributed Data Analytics
    Smith, Jeffrey
    Rege, Manjeet
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2016, 6 (01) : 52 - 64