An integrated data- and theory-driven crash severity model

被引:2
|
作者
Liu, Dongjie [1 ]
Li, Dawei [1 ,2 ,3 ]
Sze, N. N. [4 ]
Ding, Hongliang [4 ,5 ]
Song, Yuchen [1 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing 211189, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 211189, Jiangsu, Peoples R China
[3] Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Nanjing 211189, Jiangsu, Peoples R China
[4] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[5] Southwest Jiaotong Univ, Inst Smart City & Intelligent Transportta, Inst Urban Rail Transportat, Chengdu 611756, Sichuan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Logit model; Crash severity; Embedding representations; Data; and theory-driven model; Interpretable machine learning; DISCRETE-CHOICE MODELS; DEEP NEURAL-NETWORKS; INJURY SEVERITY; MULTINOMIAL LOGIT; ACCIDENT SEVERITY; ROLLOVER CRASHES; VEHICLE; REPRESENTATION; HIGHWAYS;
D O I
10.1016/j.aap.2023.107282
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
For crash severity modeling, researchers typically view theory-driven models and data-driven models as different or even conflicting approaches. The reason is that the machine-learning models offer good predictability but weak interpretability, while the latter has robust interpretability but moderate predictability. In order to alleviate the tension between them, this study proposes an integrated data- and theory-driven crash-severity model, known as Embedded Fusion model based on Text Vector Representations (TVR-EF), by leveraging the complementary strengths of both. The model specification consists of two parts. (i) the data-driven component not only mitigate the deficiencies of traditional econometric models, where one-hot encoding is frequently used and makes it impossible to observe semantic relatedness between variable categories, but also enhances the interpretability for the relationship between crash severity and potential influencing factors using the learned embedding weight matrix. (ii) In the theory-driven component, the multinomial logit model is implemented as a 2D-Convolutional Neural Network (2D-CNN) to increase flexibility and decrease dependency on prior knowledge for different crash-severity outcomes. A crash dataset from Guangdong Province, China, is utilized to estimate the TVR-EF model, which is then benchmarked against two traditional econometric models and three widely used machine-learning models. Results indicate that TVR-EF model does not only improve the predictive performance but also makes it easier to interpret.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] Fake News Early Detection: A Theory-driven Model
    Zhou, Xinyi
    Jain, Atishay
    Phoha, Vir V.
    Zafarani, Reza
    DIGITAL THREATS: RESEARCH AND PRACTICE, 2020, 1 (02):
  • [12] Why Does Data-Driven Beat Theory-Driven Computer Vision?
    Tsotsos, John
    Kotseruba, Iuliia
    Andreopoulos, Alexander
    Wu, Yulong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2057 - 2060
  • [14] Practical Program Evaluation: Theory-Driven Evaluation and the Integrated Evaluation Perspective
    Schmidt, Uwe
    ZEITSCHRIFT FUR EVALUATION, 2016, 15 (01): : 161 - 163
  • [15] ISSUES IN THE THEORY-DRIVEN PERSPECTIVE
    CHEN, HT
    ROSSI, PH
    EVALUATION AND PROGRAM PLANNING, 1989, 12 (04) : 299 - 306
  • [16] Theory-driven choice models
    Erdem, T
    Srinivasan, K
    Amaldoss, W
    Bajari, P
    Che, H
    Ho, T
    Hutchinson, W
    Katz, M
    Keane, M
    Meyer, R
    Reiss, P
    MARKETING LETTERS, 2005, 16 (3-4) : 225 - 237
  • [17] THE THEORY-DRIVEN APPROACH TO VALIDITY
    CHEN, HT
    ROSSI, PH
    EVALUATION AND PROGRAM PLANNING, 1987, 10 (01) : 95 - 103
  • [18] Theory-Driven Choice Models
    Tülin Erdem
    Kannan Srinivasan
    Wilfred Amaldoss
    Patrick Bajari
    Hai Che
    Teck Ho
    Wes Hutchinson
    Michael Katz
    Michael Keane
    Robert Meyer
    Peter Reiss
    Marketing Letters, 2005, 16 : 225 - 237
  • [19] Theory-Driven Evaluation of a Multisite Nursing Professional Practice Model
    Gentile, Deborah
    Marzinski, Sara J.
    JOURNAL OF NURSING ADMINISTRATION, 2020, 50 (7-8): : 419 - 425
  • [20] Toward a theory-driven model of acculturation in public health research
    Abraido-Lanza, Ana F.
    Armbrister, Adria N.
    Florez, Karen R.
    Aguirre, Alejandra N.
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2006, 96 (08) : 1342 - 1346