Predicting Web Survey Breakoffs Using Machine Learning Models

被引:0
|
作者
Chen, Zeming [1 ]
Cernat, Alexandru [2 ]
Shlomo, Natalie [2 ]
机构
[1] Univ Manchester, Social Stat Dept, Manchester, Lancs, England
[2] Univ Manchester, Social Stat Dept, Social Stat, Manchester, Lancs, England
关键词
breakoff timing; time-varying variables; Cox model; LASSO Cox model; logistic regression; random forest; gradient boosting; support vector machine; RATES; TREE;
D O I
10.1177/08944393221112000
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Web surveys are becoming increasingly popular but tend to have more breakoffs compared to the interviewer-administered surveys. Survey breakoffs occur when respondents quit the survey partway through. The Cox survival model is commonly used to understand patterns of breakoffs. Nevertheless, there is a trend to using more data-driven models when the purpose is prediction, such as classification machine learning models. It is unclear in the breakoff literature what are the best statistical models for predicting question-level breakoffs. Additionally, there is no consensus about the treatment of time-varying question-level predictors, such as question response time and question word count. While some researchers use the current values, others aggregate the value from the beginning of the survey. This study develops and compares both survival models and classification models along with different treatments of time-varying variables. Based on the level of agreement between the predicted and actual breakoff, we find that the Cox model and gradient boosting outperform other survival models and classification models respectively. We also find that using the values of time-varying predictors concurrent to the breakoff status is more predictive of breakoff, compared to aggregating their values from the beginning of the survey, implying that respondents' breakoff behaviour is more driven by the current response burden.
引用
收藏
页码:573 / 591
页数:19
相关论文
共 50 条
  • [11] Predicting brain tumor presence using machine learning models
    Huang, Weiguo
    Dai, Zhenhua
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2025, 8 (01)
  • [12] Comparison of Predicting Regional Mortalities Using Machine Learning Models
    Caglar, Oguzhan
    Ozen, Figen
    ARTIFICIAL INTELLIGENCE FOR INTERNET OF THINGS (IOT) AND HEALTH SYSTEMS OPERABILITY, IOTHIC 2023, 2024, 8 : 59 - 72
  • [13] Predicting Promoters in Phage Genomes Using Machine Learning Models
    Sampaio, Marta
    Rocha, Miguel
    Oliveira, Hugo
    Dias, Oscar
    PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1005 : 105 - 112
  • [14] Predicting Web Vulnerabilities in Web Applications Based on Machine Learning
    Khalid, Muhammad Noman
    Farooq, Humera
    Iqbal, Muhammad
    Alam, Muhammad Talha
    Rasheed, Kamran
    INTELLIGENT TECHNOLOGIES AND APPLICATIONS, INTAP 2018, 2019, 932 : 473 - 484
  • [15] Learning and Predicting Key Web Navigation Patterns Using Bayesian Models
    Hassan, Malik Tahir
    Junejo, Khurum Nazir
    Karim, Asim
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2009, PT II, 2009, 5593 : 877 - 887
  • [16] A Survey for Predicting Enzyme Family Classes Using Machine Learning Methods
    Tan, Jiu-Xin
    Lv, Hao
    Wang, Fang
    Dao, Fu-Ying
    Chen, Wei
    Ding, Hui
    CURRENT DRUG TARGETS, 2019, 20 (05) : 540 - 550
  • [17] Predicting Market Impact Costs Using Nonparametric Machine Learning Models
    Park, Saerom
    Lee, Jaewook
    Son, Youngdoo
    PLOS ONE, 2016, 11 (02):
  • [18] Predicting for disease resistance in aquaculture species using machine learning models
    Palaiokostas, Christos
    AQUACULTURE REPORTS, 2021, 20
  • [19] Predicting Body Composition In The US Population Using Machine Learning Models
    Xu, Huaijin
    Situ, Jason
    Hou, Ruibo
    Li, Mingxi
    Gao, Xiaotian
    MEDICINE & SCIENCE IN SPORTS & EXERCISE, 2024, 56 (10) : 479 - 479
  • [20] Predicting the thermodynamic stability of perovskite oxides using machine learning models
    Li, Wei
    Jacobs, Ryan
    Morgan, Dane
    COMPUTATIONAL MATERIALS SCIENCE, 2018, 150 : 454 - 463