Predicting Web Survey Breakoffs Using Machine Learning Models

被引：0

作者：

Chen, Zeming ^{[1
]}

Cernat, Alexandru ^{[2
]}

Shlomo, Natalie ^{[2
]}

机构：

[1] Univ Manchester, Social Stat Dept, Manchester, Lancs, England

[2] Univ Manchester, Social Stat Dept, Social Stat, Manchester, Lancs, England

来源：

SOCIAL SCIENCE COMPUTER REVIEW | 2023年 / 41卷 / 02期

关键词：

breakoff timing; time-varying variables; Cox model; LASSO Cox model; logistic regression; random forest; gradient boosting; support vector machine; RATES; TREE;

D O I：

10.1177/08944393221112000

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Web surveys are becoming increasingly popular but tend to have more breakoffs compared to the interviewer-administered surveys. Survey breakoffs occur when respondents quit the survey partway through. The Cox survival model is commonly used to understand patterns of breakoffs. Nevertheless, there is a trend to using more data-driven models when the purpose is prediction, such as classification machine learning models. It is unclear in the breakoff literature what are the best statistical models for predicting question-level breakoffs. Additionally, there is no consensus about the treatment of time-varying question-level predictors, such as question response time and question word count. While some researchers use the current values, others aggregate the value from the beginning of the survey. This study develops and compares both survival models and classification models along with different treatments of time-varying variables. Based on the level of agreement between the predicted and actual breakoff, we find that the Cox model and gradient boosting outperform other survival models and classification models respectively. We also find that using the values of time-varying predictors concurrent to the breakoff status is more predictive of breakoff, compared to aggregating their values from the beginning of the survey, implying that respondents' breakoff behaviour is more driven by the current response burden.

引用

页码：573 / 591

页数：19

共 50 条

[1] Predicting Trending Elements on Web Pages Using Machine Learning
Khalil, Naziha Shekh
Eraslan, Sukru
Yesilada, Yeliz
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024, 40 (22) : 7065 - 7080
[2] Predicting Asthma Exacerbations Using Machine Learning Models
Turcatel, Gianluca
Xiao, Yi
Caveney, Scott
Gnacadja, Gilles
Kim, Julie
Molfino, Nestor A.
ADVANCES IN THERAPY, 2025, 42 (01) : 362 - 374
[3] Predicting overall survival in chordoma patients using machine learning models: a web-app application
Cheng, Peng
Xie, Xudong
Knoedler, Samuel
Mi, Bobin
Liu, Guohui
JOURNAL OF ORTHOPAEDIC SURGERY AND RESEARCH, 2023, 18 (01)
[4] Predicting overall survival in chordoma patients using machine learning models: a web-app application
Peng Cheng
Xudong Xie
Samuel Knoedler
Bobin Mi
Guohui Liu
Journal of Orthopaedic Surgery and Research, 18
[5] The Impact of Textual Messages of Encouragement on Web Survey Breakoffs: An Experiment
Sakshaug, Joseph W.
Crawford, Scott D.
INTERNATIONAL JOURNAL OF INTERNET SCIENCE, 2009, 4 (01) : 50 - 60
[6] Impact of question topics and filter question formats on web survey breakoffs
Chen, Zeming
Cernat, Alexandru
Shlomo, Natalie
Eckman, Stephanie
INTERNATIONAL JOURNAL OF MARKET RESEARCH, 2022, 64 (06) : 710 - 726
[7] Predicting the Occurrence of Metabolic Syndrome Using Machine Learning Models
Trigka, Maria
Dritsas, Elias
Lahoz-Beltra, Rafael
Zhang, Yudong
COMPUTATION, 2023, 11 (09)
[8] PREDICTING HEALTHCARE COSTS OF DIABETES USING MACHINE LEARNING MODELS
Gonzalez Rodriguez, J.
Pinzon Espitia, O. L.
Franco, C.
Augusto, V
VALUE IN HEALTH, 2019, 22 : S575 - S575
[9] Predicting maternal risk level using machine learning models
Al Mashrafi, Sulaiman Salim
Tafakori, Laleh
Abdollahian, Mali
BMC PREGNANCY AND CHILDBIRTH, 2024, 24 (01)
[10] Predicting wildfires in Algerian forests using machine learning models
Zaidi, Abdelhamid
HELIYON, 2023, 9 (07)

← 1 2 3 4 5 →