An integrated data- and theory-driven crash severity model

被引:2
|
作者
Liu, Dongjie [1 ]
Li, Dawei [1 ,2 ,3 ]
Sze, N. N. [4 ]
Ding, Hongliang [4 ,5 ]
Song, Yuchen [1 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing 211189, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Key Lab Urban ITS, Nanjing 211189, Jiangsu, Peoples R China
[3] Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Nanjing 211189, Jiangsu, Peoples R China
[4] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[5] Southwest Jiaotong Univ, Inst Smart City & Intelligent Transportta, Inst Urban Rail Transportat, Chengdu 611756, Sichuan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Logit model; Crash severity; Embedding representations; Data; and theory-driven model; Interpretable machine learning; DISCRETE-CHOICE MODELS; DEEP NEURAL-NETWORKS; INJURY SEVERITY; MULTINOMIAL LOGIT; ACCIDENT SEVERITY; ROLLOVER CRASHES; VEHICLE; REPRESENTATION; HIGHWAYS;
D O I
10.1016/j.aap.2023.107282
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
For crash severity modeling, researchers typically view theory-driven models and data-driven models as different or even conflicting approaches. The reason is that the machine-learning models offer good predictability but weak interpretability, while the latter has robust interpretability but moderate predictability. In order to alleviate the tension between them, this study proposes an integrated data- and theory-driven crash-severity model, known as Embedded Fusion model based on Text Vector Representations (TVR-EF), by leveraging the complementary strengths of both. The model specification consists of two parts. (i) the data-driven component not only mitigate the deficiencies of traditional econometric models, where one-hot encoding is frequently used and makes it impossible to observe semantic relatedness between variable categories, but also enhances the interpretability for the relationship between crash severity and potential influencing factors using the learned embedding weight matrix. (ii) In the theory-driven component, the multinomial logit model is implemented as a 2D-Convolutional Neural Network (2D-CNN) to increase flexibility and decrease dependency on prior knowledge for different crash-severity outcomes. A crash dataset from Guangdong Province, China, is utilized to estimate the TVR-EF model, which is then benchmarked against two traditional econometric models and three widely used machine-learning models. Results indicate that TVR-EF model does not only improve the predictive performance but also makes it easier to interpret.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Fairness Theory-Driven Incentive Model for Prefabricated Building Development
    Xiaojuan Li
    Chen Wang
    Mukhtar A. Kassem
    Samuel Bimenyimana
    Arabian Journal for Science and Engineering, 2022, 47 : 13487 - 13498
  • [22] Towards an evidence base of theory-driven evaluations: Some questions for proponents of theory-driven evaluation
    Sridharan, Sanjeev
    Nakaima, April
    EVALUATION, 2012, 18 (03) : 378 - 395
  • [23] Are Theory-Driven Behavior Change Interventions Truly Theory Driven?
    Conn, Vicki S.
    WESTERN JOURNAL OF NURSING RESEARCH, 2009, 31 (03) : 287 - 288
  • [24] Fairness Theory-Driven Incentive Model for Prefabricated Building Development
    Li, Xiaojuan
    Wang, Chen
    Kassem, Mukhtar A.
    Bimenyimana, Samuel
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (10) : 13487 - 13498
  • [25] A theory-driven evaluation of Integrated Health and Social Care Programmes in the Apulia Region
    Ferrara, Lucia
    Moro, Giuseppe
    INTERNATIONAL JOURNAL OF INTEGRATED CARE, 2016, 16
  • [26] Theory-driven or process-driven prediction? Epistemological challenges of big data analytics
    Elragal A.
    Klischewski R.
    Elragal, Ahmed (ahmed.elragal@ltu.se), 2017, SpringerOpen (04)
  • [27] IMPLEMENTATION THEORY AND THE THEORY-DRIVEN APPROACH TO VALIDITY
    PALUMBO, DJ
    OLIVERIO, A
    EVALUATION AND PROGRAM PLANNING, 1989, 12 (04) : 337 - 344
  • [28] Analyzing Social Exchange Motives With Theory-Driven Data and Machine Learning
    Igwe, Kevin
    Durrheim, Kevin
    IEEE ACCESS, 2024, 12 : 2135 - 2149
  • [29] A Data- Driven Approach for Stylolite Detection
    Cheng, Jingru
    He, Bohao
    Horne, Roland
    SPE JOURNAL, 2025, 30 (01): : 1 - 12
  • [30] Data- and model-driven selection using parallel line groups
    SyedaMahmood, TF
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1997, 67 (03) : 205 - 222