A Deep Learning-Based Multimodal Architecture to predict Signs of Dementia

被引:4
|
作者
Ortiz-Perez, David [1 ]
Ruiz-Ponce, Pablo [1 ]
Tomas, David [2 ]
Garcia-Rodriguez, Jose [1 ]
Vizcaya-Moreno, M. Flores [3 ]
Leo, Marco [4 ]
机构
[1] Univ Alicante, Dept Comp Sci & Technol, Carretera San Vicente Raspeig, Alicante 03690, Spain
[2] Univ Alicante, Dept Software & Comp Syst, Carretera San Vicente Raspeig, Alicante 03690, Spain
[3] Univ Alicante, Fac Hlth Sci, Unit Clin Nursing Res, Carretera San Vicente Raspeig, Alicante 03690, Spain
[4] Natl Res Council Italy, Inst Appl Sci & Intelligent Syst, I-73100 Lecce, Italy
关键词
Multimodal; Deep learning; Transformers; Dementia prediction;
D O I
10.1016/j.neucom.2023.126413
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a multimodal deep learning architecture combining text and audio information to predict dementia, a disease which affects around 55 million people all over the world and makes them in some cases dependent people. The system was evaluated on the DementiaBank Pitt Corpus dataset, which includes audio recordings as well as their transcriptions for healthy people and people with dementia. Different models have been used and tested, including Convolutional Neural Networks (CNN) for audio classification, Transformers for text classification, and a combination of both in a multimodal ensemble. These models have been evaluated on a test set, obtaining the best results by using the text modality, achieving 90.36% accuracy on the task of detecting dementia. Additionally, an analysis of the corpus has been conducted for the sake of explainability, aiming to obtain more information about how the models generate their predictions and identify patterns in the data. & COPY; 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Deep learning-based brain age prediction in normal aging and dementia
    Jeyeon Lee
    Brian J. Burkett
    Hoon-Ki Min
    Matthew L. Senjem
    Emily S. Lundt
    Hugo Botha
    Jonathan Graff-Radford
    Leland R. Barnard
    Jeffrey L. Gunter
    Christopher G. Schwarz
    Kejal Kantarci
    David S. Knopman
    Bradley F. Boeve
    Val J. Lowe
    Ronald C. Petersen
    Clifford R. Jack
    David T. Jones
    Nature Aging, 2022, 2 : 412 - 424
  • [22] Deep learning-based brain age prediction in normal aging and dementia
    Lee, Jeyeon
    Burkett, Brian J.
    Min, Hoon-Ki
    Senjem, Matthew L.
    Lundt, Emily S.
    Botha, Hugo
    Graff-Radford, Jonathan
    Barnard, Leland R.
    Gunter, Jeffrey L.
    Schwarz, Christopher G.
    Kantarci, Kejal
    Knopman, David S.
    Boeve, Bradley F.
    Lowe, Val J.
    Petersen, Ronald C.
    Jack, Clifford R., Jr.
    Jones, David T.
    NATURE AGING, 2022, 2 (05): : 412 - +
  • [23] A Multimodal Classification Architecture for the Severity Diagnosis of Glaucoma Based on Deep Learning
    Yi, Sanli
    Zhang, Gang
    Qian, Chaoxu
    Lu, YunQing
    Zhong, Hua
    He, Jianfeng
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [24] Deep Learning-Based Detection of Pigment Signs for Analysis and Diagnosis of Retinitis Pigmentosa
    Arsalan, Muhammad
    Baek, Na Rae
    Owais, Muhammad
    Mahmood, Tahir
    Park, Kang Ryoung
    SENSORS, 2020, 20 (12) : 1 - 20
  • [25] Deep Learning-based Approach to Predict Pulmonary Function at Chest CT
    Park, Hyunjung
    Yun, Jihye
    Lee, Sang Min
    Hwang, Hye Jeon
    Seo, Joon Beom
    Jung, Young Ju
    Hwang, Jeongeun
    Lee, Se Hee
    Lee, Sei Won
    Kim, Namkug
    RADIOLOGY, 2023, 307 (02)
  • [26] DeepAlloDriver: a deep learning-based strategy to predict cancer driver mutations
    Song, Qianqian
    Li, Mingyu
    Li, Qian
    Lu, Xun
    Song, Kun
    Zhang, Ziliang
    Wei, Jiale
    Zhang, Liang
    Wei, Jiacheng
    Ye, Youqiong
    Zha, Jinyin
    Zhang, Qiufen
    Gao, Qiang
    Long, Jiang
    Liu, Xinyi
    Lu, Xuefeng
    Zhang, Jian
    NUCLEIC ACIDS RESEARCH, 2023, 51 (W1) : W129 - W133
  • [27] A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face
    Lian, Hailun
    Lu, Cheng
    Li, Sunan
    Zhao, Yan
    Tang, Chuangao
    Zong, Yuan
    ENTROPY, 2023, 25 (10)
  • [28] Time Awareness in Deep Learning-Based Multimodal Fusion Across Smartphone Platforms
    Sandha, Sandeep Singh
    Noor, Joseph
    Anwar, Fatima M.
    Srivastava, Mani
    2020 ACM/IEEE FIFTH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION (IOTDI 2020), 2020, : 149 - 156
  • [29] Towards Safer Roads: A Deep Learning-Based Multimodal Fatigue Monitoring System
    Hashemi, Maryam
    Farahani, Bahar
    Firouzi, Farshad
    2020 INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (IEEE COINS 2020), 2020, : 200 - 207
  • [30] Deep Learning-Based Multimodal 3 T MRI for the Diagnosis of Knee Osteoarthritis
    Hu, Yong
    Tang, Jie
    Zhao, Shenghao
    Li, Ye
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022