Private Data Analytics on Biomedical Sensing Data via Distributed Computation

被引:30
|
作者
Gong, Yanmin [1 ]
Fang, Yuguang [1 ]
Guo, Yuanxiong [2 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
[2] Oklahoma State Univ, Sch Elect & Comp Engn, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Private data analytics; mobile health; predictive model training; logistic regression; LOGISTIC-REGRESSION; ANONYMIZATION; CARE;
D O I
10.1109/TCBB.2016.2515610
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Advances in biomedical sensors and mobile communication technologies have fostered the rapid growth of mobile health (mHealth) applications in the past years. Users generate a high volume of biomedical data during health monitoring, which can be used by the mHealth server for training predictive models for disease diagnosis and treatment. However, the biomedical sensing data raise serious privacy concerns because they reveal sensitive information such as health status and lifestyles of the sensed subjects. This paper proposes and experimentally studies a scheme that keeps the training samples private while enabling accurate construction of predictive models. We specifically consider logistic regression models which are widely used for predicting dichotomous outcomes in healthcare, and decompose the logistic regression problem into small subproblems over two types of distributed sensing data, i.e., horizontally partitioned data and vertically partitioned data. The subproblems are solved using individual private data, and thus mHealth users can keep their private data locally and only upload (encrypted) intermediate results to the mHealth server for model training. Experimental results based on real datasets show that our scheme is highly efficient and scalable to a large number of mHealth users.
引用
收藏
页码:431 / 444
页数:14
相关论文
共 50 条
  • [31] Data Distribution and Scheduling for Distributed Analytics Tasks
    Pasteris, Stephen
    Wang, Shiqiang
    Makaya, Christian
    Chan, Kevin
    Herbster, Mark
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [32] Distributed algorithm for big data analytics in healthcare
    Forestiero, Agostino
    Papuzzo, Giuseppe
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 776 - 779
  • [33] Distributed Big Data Analytics in Service Computing
    Yu, Weider D.
    Gottumukkala, AvinashChander
    Senthailselvi, Deenash Arivazhagan
    Maniraj, Prabhu
    Khonde, Tushar
    2017 IEEE 13TH INTERNATIONAL SYMPOSIUM ON AUTONOMOUS DECENTRALIZED SYSTEMS (ISADS 2017), 2017, : 55 - 60
  • [34] Data Analytics Algorithm Benchmark on Distributed Systems
    Hamid, Mohd Hakim Abdul
    Abu, Nur Azman
    Mohamad, Siti Nurul Mahfuzah
    Idris, Ariff
    Zakaria, Zahriladha
    Sulaiman, Zuraidah
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND TECHNOLOGY (ICAST'18), 2018, 2016
  • [35] Distributed Data Analytics Framework for Smart Transportation
    Howard, Alexander J.
    Lee, Tim
    Mahar, Sara
    Intrevado, Paul
    Myung-kyung, Diane
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1374 - 1380
  • [36] Visually Programming Dataflows for Distributed Data Analytics
    Thamsen, Lauritz
    Renner, Thomas
    Byfeld, Marvin
    Paeschke, Markus
    Schroeder, Daniel
    Boehm, Felix
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2285 - 2294
  • [37] Differentially Private Distributed Data Analysis
    Takabi, Hassan
    Koppikar, Samir
    Zargar, Saman Taghavi
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 212 - 218
  • [38] Pangea: Monolithic Distributed Storage for Data Analytics
    Zou, Jia
    Iyengar, Arun
    Jermaine, Chris
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (06): : 681 - 694
  • [39] Distributed data networks: a blueprint for Big Data sharing and healthcare analytics
    Popovic, Jennifer R.
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 2017, 1387 (01) : 105 - 111
  • [40] Stereo data sensing, computation and perception
    Lu, Ke
    Yang, You
    Zhen, Yi
    NEUROCOMPUTING, 2016, 215 : 1 - 2