Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry

被引:13
|
作者
Cao, Han [1 ]
Meyer-Lindenberg, Andreas [1 ]
Schwarz, Emanuel [1 ]
机构
[1] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Psychiat & Psychotherapy, D-68159 Mannheim, Germany
关键词
multi-task learning; machine learning; biomarker discovery; psychiatry; GENE-EXPRESSION; MEGA-ANALYSIS; SCHIZOPHRENIA; PROFILES; DISEASES; FUTURE;
D O I
10.3390/ijms19113387
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Machine Learning Challenges in Big Data Era
    Veganzones-Bodon, Miguel
    DYNA, 2019, 94 (05): : 478 - 479
  • [32] Machine learning in 'big data': handle with care
    Loring, Zak
    Mehrotra, Suchit
    Piccini, Jonathan P.
    EUROPACE, 2019, 21 (09): : 1284 - 1285
  • [33] Machine Learning and Computational Intelligence in Big Data
    Anagnostopoulos, Christos
    Kolomvatsos, Kostas
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 873 - 874
  • [34] Machine learning on big data: Opportunities and challenges
    Zhou, Lina
    Pan, Shimei
    Wang, Jianwu
    Vasilakos, Athanasios V.
    NEUROCOMPUTING, 2017, 237 : 350 - 361
  • [35] Big Data and Machine Learning Framework in Healthcare
    Dogaru, Delia Ioana
    Dumitrache, Ioan
    2019 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2019,
  • [36] PivotalR: A Package for Machine Learning on Big Data
    Qian, Hai
    R JOURNAL, 2014, 6 (01): : 57 - 67
  • [37] Green Computing for Big Data and Machine Learning
    Barua, Hrishav Bakul
    Mondal, Kartick Chandra
    Khatua, Sunirmal
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 348 - 351
  • [38] Machine learning for Big Data analytics in plants
    Ma, Chuang
    Zhang, Hao Helen
    Wang, Xiangfeng
    TRENDS IN PLANT SCIENCE, 2014, 19 (12) : 798 - 808
  • [39] Big data algorithms beyond machine learning
    Mnich M.
    KI - Kunstliche Intelligenz, 2018, 32 (01): : 9 - 17
  • [40] Big data and machine learning for materials science
    Rodrigues J.F., Jr.
    Florea L.
    de Oliveira M.C.F.
    Diamond D.
    Oliveira O.N., Jr.
    Discover Materials, 1 (1):