Linguistic and Statistically Derived Features for Cause of Death Prediction from Verbal Autopsy Text

被引:0
|
作者
Danso, Samuel [1 ,2 ]
Atwell, Eric [1 ,2 ]
Johnson, Owen [2 ]
机构
[1] Univ Leeds, Language Res Grp, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Leeds, Yorkshire Ctr Hlth Informat, EHlth Res Grp, Leeds LS2 9JT, W Yorkshire, England
关键词
Verbal Autopsy; Cause of Death Prediction; Features; Text Classification; CLASSIFICATION; VALIDATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Text Classification (ATC) is an emerging technology with economic importance given the unprecedented growth of text data. This paper reports on work in progress to develop methods for predicting Cause of Death from Verbal Autopsy (VA) documents recommended for use in low-income countries by the World Health Organisation. VA documents contain both coded data and open narrative. The task is formulated as a Text Classification problem and explores various combinations of linguistic and statistical approaches to determine how these may improve on the standard bag-of-words approach using a dataset of over 6400 VA documents that were manually annotated with cause of death. We demonstrate that a significant improvement of prediction accuracy can be obtained through a novel combination of statistical and linguistic features derived from the VA text. The paper explores the methods by which ATC may leads to improved accuracy in Cause of Death prediction.
引用
收藏
页码:47 / 60
页数:14
相关论文
共 50 条
  • [21] Can Character Embeddings Improve Cause-of-Death Classification for Verbal Autopsy Narratives?
    Yan, Zhaodong
    Jeblee, Serena
    Hirst, Graeme
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 234 - 239
  • [22] The paradox of verbal autopsy in cause of death assignment: symptom question unreliability but predictive accuracy
    Peter Serina
    Ian Riley
    Bernardo Hernandez
    Abraham D. Flaxman
    Devarsetty Praveen
    Veronica Tallo
    Rohina Joshi
    Diozele Sanvictores
    Andrea Stewart
    Meghan D. Mooney
    Christopher J. L. Murray
    Alan D. Lopez
    Population Health Metrics, 14
  • [23] The paradox of verbal autopsy in cause of death assignment: symptom question unreliability but predictive accuracy
    Serina, Peter
    Riley, Ian
    Hernandez, Bernardo
    Flaxman, Abraham D.
    Praveen, Devarsetty
    Tallo, Veronica
    Joshi, Rohina
    Sanvictores, Diozele
    Stewart, Andrea
    Mooney, Meghan D.
    Murray, Christopher J. L.
    Lopez, Alan D.
    POPULATION HEALTH METRICS, 2016, 14
  • [24] Applying a Public Health Ethics Framework to Consider Scaled-Up Verbal Autopsy and Verbal Autopsy with Immediate Disclosure of Cause of Death in Rural Nepal
    Morrison, Joanna
    Fottrell, Edward
    Budhatokhi, Bharat
    Bird, Jon
    Basnet, Machhindra
    Manandhar, Mangala
    Shrestha, Rita
    Manandhar, Dharma
    Wilson, James
    PUBLIC HEALTH ETHICS, 2018, 11 (03) : 293 - 310
  • [25] Using verbal autopsy to ascertain perinatal cause of death: are trained non-physicians adequate?
    Engmann, C.
    Jehan, I.
    Ditekemena, J.
    Garces, A.
    Phiri, M.
    Mazariegos, M.
    Chomba, E.
    Pasha, O.
    Tshefu, A.
    Hemed, Y.
    McClure, E. M.
    Thorsten, V.
    Bann, C.
    Goldenberg, R. L.
    Bose, C.
    Setel, P.
    Carlo, W. A.
    Wright, L. L.
    TROPICAL MEDICINE & INTERNATIONAL HEALTH, 2009, 14 (12) : 1496 - 1504
  • [26] Verbal Abuse in Schools: Analyzing the Features of Teachers' Verbal Abuse From the Linguistic Perspective
    Yusri
    Mantasiah, R.
    Aryani, Farida
    Hasmawati
    JOURNAL OF INTERPERSONAL VIOLENCE, 2025, 40 (3-4) : 828 - 849
  • [27] Are multiple coders needed for cause of death assignment: results from telephonic verbal autopsy interviews conducted in 2021 in South Africa
    Neethling, Ian
    Morof, Diane
    Glass, Tracy
    Kallis, Natasha
    Rao, Chalapati
    Bradshaw, Debbie
    Groenewald, Pam
    DIGITAL HEALTH, 2024, 10
  • [28] Cause-of-death ascertainment for deaths that occur outside hospitals in Thailand: application of verbal autopsy methods
    Warangkana Polprasert
    Chalapati Rao
    Timothy Adair
    Junya Pattaraarchachai
    Yawarat Porapakkham
    Alan D Lopez
    Population Health Metrics, 8
  • [29] Effects of recall time on cause-of-death findings using verbal autopsy: empirical evidence from rural South Africa
    Hussain-Alkhateeb L.
    Petzold M.
    Collinson M.
    Tollman S.
    Kahn K.
    Byass P.
    Emerging Themes in Epidemiology, 13 (1):
  • [30] Cause-of-death ascertainment for deaths that occur outside hospitals in Thailand: application of verbal autopsy methods
    Polprasert, Warangkana
    Rao, Chalapati
    Adair, Timothy
    Pattaraarchachai, Junya
    Porapakkham, Yawarat
    Lopez, Alan D.
    POPULATION HEALTH METRICS, 2010, 8