Predicting dropout from psychological treatment using different machine learning algorithms, resampling methods, and sample sizes

被引:9
|
作者
Giesemann, Julia [1 ,3 ]
Delgadillo, Jaime [2 ]
Schwartz, Brian [1 ]
Bennemann, Bjoern [1 ]
Lutz, Wolfgang [1 ]
机构
[1] Univ Trier, Dept Psychol, Clin Psychol & Psychotherapy, Trier, Germany
[2] Univ Sheffield, Dept Psychol, Clin & Applied Psychol Unit, Sheffield, England
[3] Wissenschaftspark 25 27, D-54296 Trier, Germany
关键词
dropout prediction; machine learning; supervised learning; sample size; resampling methods; data imbalance; COGNITIVE-BEHAVIORAL THERAPY; PREMATURE TERMINATION; ANXIETY DISORDERS; PSYCHOTHERAPY; CLASSIFICATION; DEPRESSION; PATIENT; MODELS; PERFORMANCE; PHARMACOTHERAPY;
D O I
10.1080/10503307.2022.2161432
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Objective:The occurrence of dropout from psychological interventions is associated with poor treatment outcome and high health, societal and economic costs. Recently, machine learning (ML) algorithms have been tested in psychotherapy outcome research. Dropout predictions are usually limited by imbalanced datasets and the size of the sample. This paper aims to improve dropout prediction by comparing ML algorithms, sample sizes and resampling methods.Method:Twenty ML algorithms were examined in twelve subsamples (drawn from a sample of N = 49,602) using four resampling methods in comparison to the absence of resampling and to each other. Prediction accuracy was evaluated in an independent holdout dataset using the F-1-Measure.Results:Resampling methods improved the performance of ML algorithms and down-sampling can be recommended, as it was the fastest method and as accurate as the other methods. For the highest mean F-1-Score of .51 a minimum sample size of N = 300 was necessary. No specific algorithm or algorithm group can be recommended.Conclusion:Resampling methods could improve the accuracy of predicting dropout in psychological interventions. Down-sampling is recommended as it is the least computationally taxing method. The training sample should contain at least 300 cases.
引用
收藏
页码:683 / 695
页数:13
相关论文
共 50 条
  • [1] Comparing Different Resampling Methods in Predicting Students Performance Using Machine Learning Techniques
    Ghorbani, Ramin
    Ghousi, Rouzbeh
    IEEE ACCESS, 2020, 8 : 67899 - 67911
  • [2] DIAGNOSIS OF THE DISEASES USING RESAMPLING METHODS WITH MACHINE LEARNING ALGORITHMS
    Celik, Ahmet
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2023, 76 (07): : 1065 - 1076
  • [3] Ensemble of machine learning algorithms for cryptocurrency investment with different data resampling methods
    Borges, Tome Almeida
    Neves, Rui Ferreira
    APPLIED SOFT COMPUTING, 2020, 90
  • [4] Predicting Dropout From Cognitive Behavioral Therapy for Panic Disorder Using Machine Learning Algorithms
    Ogawa, Sei
    JOURNAL OF CLINICAL MEDICINE RESEARCH-CANADA, 2024, 16 (05): : 251 - 255
  • [5] Predicting Stock Prices Using Machine Learning Methods and Deep Learning Algorithms: The Sample of the Istanbul Stock Exchange
    Demirel, Ugur
    Cam, Handan
    Unlu, Ramazan
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2021, 34 (01): : 63 - 82
  • [6] Predicting Packaging Sizes Using Machine Learning
    Heininger M.
    Ortner R.
    Operations Research Forum, 3 (3)
  • [7] Integrated machine learning methods with resampling algorithms for flood susceptibility prediction
    Dodangeh, Esmaeel
    Choubin, Bahram
    Eigdir, Ahmad Najafi
    Nabipour, Narjes
    Panahi, Mehdi
    Shamshirband, Shahaboddin
    Mosavi, Amir
    SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 705
  • [8] Student Dropout Prediction in MOOC using Machine Learning Algorithms
    Magalhaes, Elias B. M.
    Santos, Giovanni A.
    Molina Junior, Francisco Carlos D.
    da Costa, Joao Paulo J.
    de Mendonca, Fabio L. L.
    de Sousa Junior, Rafael T.
    2021 WORKSHOP ON COMMUNICATION NETWORKS AND POWER SYSTEMS (WCNPS), 2021,
  • [9] Towards Predicting Student's Dropout in University Courses Using Different Machine Learning Techniques
    Kabathova, Janka
    Drlik, Martin
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [10] Predicting Lifetime Suicide Attempts in a Community Sample of Adolescents Using Machine Learning Algorithms
    Jankowsky, Kristin
    Steger, Diana
    Schroeders, Ulrich
    ASSESSMENT, 2024, 31 (03) : 557 - 573