Predicting dropout from psychological treatment using different machine learning algorithms, resampling methods, and sample sizes

被引:9
|
作者
Giesemann, Julia [1 ,3 ]
Delgadillo, Jaime [2 ]
Schwartz, Brian [1 ]
Bennemann, Bjoern [1 ]
Lutz, Wolfgang [1 ]
机构
[1] Univ Trier, Dept Psychol, Clin Psychol & Psychotherapy, Trier, Germany
[2] Univ Sheffield, Dept Psychol, Clin & Applied Psychol Unit, Sheffield, England
[3] Wissenschaftspark 25 27, D-54296 Trier, Germany
关键词
dropout prediction; machine learning; supervised learning; sample size; resampling methods; data imbalance; COGNITIVE-BEHAVIORAL THERAPY; PREMATURE TERMINATION; ANXIETY DISORDERS; PSYCHOTHERAPY; CLASSIFICATION; DEPRESSION; PATIENT; MODELS; PERFORMANCE; PHARMACOTHERAPY;
D O I
10.1080/10503307.2022.2161432
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Objective:The occurrence of dropout from psychological interventions is associated with poor treatment outcome and high health, societal and economic costs. Recently, machine learning (ML) algorithms have been tested in psychotherapy outcome research. Dropout predictions are usually limited by imbalanced datasets and the size of the sample. This paper aims to improve dropout prediction by comparing ML algorithms, sample sizes and resampling methods.Method:Twenty ML algorithms were examined in twelve subsamples (drawn from a sample of N = 49,602) using four resampling methods in comparison to the absence of resampling and to each other. Prediction accuracy was evaluated in an independent holdout dataset using the F-1-Measure.Results:Resampling methods improved the performance of ML algorithms and down-sampling can be recommended, as it was the fastest method and as accurate as the other methods. For the highest mean F-1-Score of .51 a minimum sample size of N = 300 was necessary. No specific algorithm or algorithm group can be recommended.Conclusion:Resampling methods could improve the accuracy of predicting dropout in psychological interventions. Down-sampling is recommended as it is the least computationally taxing method. The training sample should contain at least 300 cases.
引用
收藏
页码:683 / 695
页数:13
相关论文
共 50 条
  • [41] A Model for Predicting Cervical Cancer Using Machine Learning Algorithms
    Al Mudawi, Naif
    Alazeb, Abdulwahab
    SENSORS, 2022, 22 (11)
  • [42] Predicting Fitness and Performance of Diving using Machine Learning Algorithms
    Mahajan, Uma
    Krishnan, Anup
    Malhotra, Vineet
    Sharma, Deep
    Gore, Sharad
    2019 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2019,
  • [43] Predicting the pharmaceutical needs of hospitals using machine learning algorithms
    Nabizadeh, Amir Hossein
    Ghaemi, Mohammad Mehdi
    Goncalves, Daniel
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [44] Predicting coronary artery calcification using machine learning algorithms
    sun, Y. V.
    Bielak, L. F.
    Pevser, P. A.
    Turner, S. T.
    Sheed, P. F., II
    Boerwinkle, E.
    Kardial, S. L. R.
    GENETIC EPIDEMIOLOGY, 2007, 31 (05) : 499 - 499
  • [45] Predicting the Start of Protein α-Helices Using Machine Learning Algorithms
    Camacho, Rui
    Ferreira, Rita
    Rosa, Natacha
    Guimaraes, Vania
    Fonseca, Nuno A.
    Costa, Vitor Santos
    de Sousa, Miguel
    Magalhaes, Alexandre
    ADVANCES IN BIOINFORMATICS, 2010, 74 : 33 - +
  • [46] Predicting the secondary structure of proteins using Machine Learning algorithms
    Camacho, Rui
    Ferreira, Rita
    Rosa, Natacha
    Guimaraes, Vania
    Fonseca, Nuno A.
    Costa, Vitor Santos
    de Sousa, Miguel
    Magalhaes, Alexandre
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2012, 6 (06) : 571 - 584
  • [47] Predicting Individual Thermal Comfort using Machine Learning Algorithms
    Farhan, Asma Ahmad
    Pattipati, Krishna
    Wang, Bing
    Luh, Peter
    2015 INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2015, : 708 - 713
  • [48] Predicting Cervical Cancer using Advanced Machine Learning Algorithms
    Vaishnodevi, S.
    Devarajan, N. Manikanda
    Murali, G.
    Kumar, D. Vinod
    Madhuvappan, C. Arunkumar
    Siva, C.
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1600 - 1604
  • [49] Predicting cash holdings using supervised machine learning algorithms
    Ozlem, Sirin
    Tan, Omer Faruk
    FINANCIAL INNOVATION, 2022, 8 (01)
  • [50] Predicting bid prices by using machine learning methods
    Kim, Jong-Min
    Jung, Hojin
    APPLIED ECONOMICS, 2019, 51 (19) : 2011 - 2018