Predicting Web Survey Breakoffs Using Machine Learning Models

被引:0
|
作者
Chen, Zeming [1 ]
Cernat, Alexandru [2 ]
Shlomo, Natalie [2 ]
机构
[1] Univ Manchester, Social Stat Dept, Manchester, Lancs, England
[2] Univ Manchester, Social Stat Dept, Social Stat, Manchester, Lancs, England
关键词
breakoff timing; time-varying variables; Cox model; LASSO Cox model; logistic regression; random forest; gradient boosting; support vector machine; RATES; TREE;
D O I
10.1177/08944393221112000
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Web surveys are becoming increasingly popular but tend to have more breakoffs compared to the interviewer-administered surveys. Survey breakoffs occur when respondents quit the survey partway through. The Cox survival model is commonly used to understand patterns of breakoffs. Nevertheless, there is a trend to using more data-driven models when the purpose is prediction, such as classification machine learning models. It is unclear in the breakoff literature what are the best statistical models for predicting question-level breakoffs. Additionally, there is no consensus about the treatment of time-varying question-level predictors, such as question response time and question word count. While some researchers use the current values, others aggregate the value from the beginning of the survey. This study develops and compares both survival models and classification models along with different treatments of time-varying variables. Based on the level of agreement between the predicted and actual breakoff, we find that the Cox model and gradient boosting outperform other survival models and classification models respectively. We also find that using the values of time-varying predictors concurrent to the breakoff status is more predictive of breakoff, compared to aggregating their values from the beginning of the survey, implying that respondents' breakoff behaviour is more driven by the current response burden.
引用
收藏
页码:573 / 591
页数:19
相关论文
共 50 条
  • [41] Predicting concrete strength through packing density using machine learning models
    Pallapothu, Swamy Naga Ratna Giri
    Pancharathi, Rathish Kumar
    Janib, Rakesh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [42] Predicting the shear modulus and damping ratio of soils using machine learning models
    Bayat, M.
    Mousavi, Z.
    Feng, W. W.
    GEOSHANGHAI 2024 INTERNATIONAL CONFERENCE, VOL 5, 2024, 1334
  • [43] Predicting the phosphorylation sites using hidden Markov models and machine learning methods
    Senawongse, P
    Dalby, AR
    Yang, ZR
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (04) : 1147 - 1152
  • [44] Predicting harvesting efficiency of microalgae with magnetic nanoparticles using machine learning models
    Fu, Yu
    Zhang, Qingran
    Tan, Zhengying
    Yu, Songxia
    Zhang, Yi
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2025, 13 (02):
  • [45] Characterizing and Predicting the Quality of Milled Rice Grains Using Machine Learning Models
    Carneiro, Leticia de Oliveira
    Coradi, Paulo Carteri
    Rodrigues, Dagila Melo
    Lima, Roney Eloy
    Teodoro, Larissa Pereira Ribeiro
    de Moraes, Rosana Santos
    Teodoro, Paulo Eduardo
    Nunes, Marcela Trojahn
    Leal, Marisa Menezes
    Lopes, Lhais Rodrigues
    Vendrusculo, Tiago Arabites
    Robattini, Jean Carlos
    Soares, Anderson Henrique
    Bilhalva, Nairiane dos Santos
    AGRIENGINEERING, 2023, 5 (03): : 1196 - 1215
  • [46] Predicting Prenatal Depression and Assessing Model Bias Using Machine Learning Models
    Huang, Yongchao
    Alvernaz, Suzanne
    Kim, Sage J.
    Maki, Pauline
    Dai, Yang
    Bernabe, Beatriz Penalver
    BIOLOGICAL PSYCHIATRY: GLOBAL OPEN SCIENCE, 2024, 4 (06):
  • [47] Predicting student success in MOOCs: a comprehensive analysis using machine learning models
    Althibyani, Hosam A.
    PeerJ Computer Science, 2024, 10
  • [48] Predicting acute suicidal ideation on Instagram using ensemble machine learning models
    Lekkas, Damien
    Klein, Robert J.
    Jacobson, Nicholas C.
    INTERNET INTERVENTIONS-THE APPLICATION OF INFORMATION TECHNOLOGY IN MENTAL AND BEHAVIOURAL HEALTH, 2021, 25
  • [49] Predicting Time to Dialysis and Unplanned Dialysis Start Using Machine Learning Models
    Shukla, Mahesh
    Garrett, Brendan C.
    Azari, Ali
    Kipping, Emily
    Culleton, Bruce F.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2021, 32 (10): : 284 - 284
  • [50] Predicting Standardized Streamflow index for hydrological drought using machine learning models
    Shamshirband, Shahabbodin
    Hashemi, Sajjad
    Salimi, Hana
    Samadianfard, Saeed
    Asadi, Esmaeil
    Shadkani, Sadra
    Kargar, Katayoun
    Mosavi, Amir
    Nabipour, Narjes
    Chau, Kwok-Wing
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2020, 14 (01) : 339 - 350