A Secure and Reusable Software Architecture for Supporting Online Data Harmonization

被引:0
|
作者
Feric, Zlatan [1 ]
Agostini, Nicolas Bohm [1 ]
Beene, Daniel [4 ]
Signes-Pastor, Antonio J. [6 ]
Halchenko, Yuliya [6 ]
Watkins, Deborah [5 ]
MacKenzie, Debra [4 ]
Karagas, Margaret [6 ]
Manjourides, Justin [2 ]
Alshawabkeh, Akram [3 ]
Kaeli, David [1 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
[2] Northeastern Univ, Bouve Coll Hlth Sci, Boston, MA 02115 USA
[3] Northeastern Univ, Dept Civil & Environm Engn, Boston, MA 02115 USA
[4] Univ New Mexico, Coll Pharm, Hlth Sci Ctr, Community Environm Hlth Program, Albuquerque, NM 87131 USA
[5] Univ Michigan, Sch Publ Hlth, Environm Hlth Sci, Ann Arbor, MI 48109 USA
[6] Geisel Sch Med Dartmouth, Dept Epidemiol, Hanover, NH USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
关键词
CHILD HEALTH OUTCOMES; ENVIRONMENTAL-INFLUENCES; EXPOSURE; MINE;
D O I
10.1109/BigData52589.2021.9671538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrospective data harmonization across multiple research cohorts and studies is frequently done to increase statistical power, provide comparison analysis, and create a richer data source for data mining. However, when combining disparate data sources, harmonization projects face data management and analysis challenges. These include differences in the data dictionaries and variable definitions, privacy concerns surrounding health data representing sensitive populations, and lack of properly defined data models. With the availability of mature open-source web-based database technologies, developing a complete software architecture to overcome the challenges associated with the harmonization process can alleviate many roadblocks. By leveraging state-of-the-art software engineering and database principles, we can ensure data quality and enable cross-center online access and collaboration. This paper outlines a complete software architecture developed and customized using the Django web framework, leveraged to harmonize sensitive data collected from three NIH-support birth cohorts. We describe our framework and show how we successfully overcame challenges faced when harmonizing data from these cohorts. We discuss our efforts in data cleaning, data sharing, data transformation, data visualization, and analytics, while reflecting on what we have learned to date from these harmonized datasets.
引用
收藏
页码:2801 / 2812
页数:12
相关论文
共 50 条
  • [21] Building and assembling reusable components based on software architecture
    Zhang, Shi-Kun
    Zhang, Wen-Juan
    Chang, Xin
    Wang, Li-Fu
    Yang, Fu-Qing
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (09): : 1351 - 1359
  • [22] Technology harmonization - Developing a reference architecture for the ground segment software
    Duro, N.
    Moreira, F.
    Rogado, J.
    Reis, J.
    Peccia, Nestor
    2005 IEEE Aerospace Conference, Vols 1-4, 2005, : 3968 - 3979
  • [23] OpenArgue: Supporting Argumentation to Evolve Secure Software Systems
    Yu, Yijun
    Thein Than Tun
    Tedeschi, Alessandra
    Franqueira, Virginia N. L.
    Nuseibeh, Bashar
    2011 19TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2011, : 351 - +
  • [24] Adaptive secure software architecture for electronic commerce
    Tak, SW
    Park, EK
    SOFTWARE-PRACTICE & EXPERIENCE, 2003, 33 (14): : 1343 - 1357
  • [25] Trusting software tools in a secure DRM architecture
    Torres, Victor
    Delgado, Jaime
    Llorente, Silvia
    AXMEDIS 2007: THIRD INTERNATIONAL CONFERENCE ON AUTOMATED PRODUCTION OF CROSS MEDIA CONTENT FOR MULTI-CHANNEL DISTRIBUTION, PROCEEDINGS, 2007, : 55 - +
  • [26] Development of a secure communication architecture for software agents
    Saenchai, K
    Benedicenti, L
    WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVII, PROCEEDINGS: CYBERNETICS AND INFORMATICS: CONCEPTS AND APPLICATIONS (PT II), 2001, : 528 - 530
  • [27] Supporting Software Architecture Evolution by Functional Decomposition
    Faitelson, David
    Heinrich, Robert
    Tyszberowicz, Shmuel
    MODELSWARD: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2017, : 435 - 442
  • [28] Towards supporting the software architecture life cycle
    Weinreich, Rainer
    Buchgeher, Georg
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (03) : 546 - 561
  • [29] Software bus architecture of CAD supporting system
    Lu, W
    Zheng, GQ
    Sun, JG
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN & COMPUTER GRAPHICS, 1999, : 989 - 993
  • [30] A SOFTWARE ARCHITECTURE FOR SUPPORTING THE EXCHANGE OF ELECTRONIC MANUSCRIPTS
    MAMRAK, SA
    KAELBLING, MJ
    NICHOLAS, CK
    SHARE, M
    COMMUNICATIONS OF THE ACM, 1987, 30 (05) : 408 - 414