Vision Transformer for Parkinson's Disease Classification using Multilingual Sustained Vowel Recordings

被引:3
|
作者
Hemmerling, Daria [1 ]
Wodzinski, Marek [1 ,2 ]
Orozco-Arroyave, Juan Rafael [3 ,4 ]
Sztaho, David [5 ]
Daniol, Mateusz [1 ]
Jemiolo, Pawel [1 ]
Wojcik-Pedziwiatr, Magdalena [6 ]
机构
[1] AGH Univ Sci & Technol, Fac Elect Engn Automat Comp Sci & Biomed Engn, Krakow, Poland
[2] Univ Appl Sci Western Switzerland, Inst Informat Syst, HES SO Valais, Sierre, Switzerland
[3] Univ Antioquia, Medellin, Colombia
[4] Univ Erlangen Nurnberg, Pattern Recognit Lab, Erlangen, Germany
[5] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
[6] Krakow Univ, Dept Neurol, Krakow, Poland
关键词
Deep Learning; Vision Transformer; Voice Processing; Neurodegenerative Diseases; Hypokinetic Dysarthria;
D O I
10.1109/EMBC40787.2023.10340478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parkinson's disease (PD) is the 2(nd) most prevalent neurodegenerative disease in the world. Thus, the early detection of PD has recently been the subject of several scientific and commercial studies. In this paper, we propose a pipeline using Vision Transformer applied to mel-spectrograms for PD classification using multilingual sustained vowel recordings. Furthermore, our proposed transformed-based model shows a great potential to use voice as a single modality biomarker for automatic PD detection without language restrictions, a wide range of vowels, with an F1-score equal to 0.78. The results of our study fall within the range of the estimated prevalence of voice and speech disorders in Parkinson's disease, which ranges from 70-90%. Our study demonstrates a high potential for adaptation in clinical decision-making, allowing for increasingly systematic and fast diagnosis of PD with the potential for use in telemedicine.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] A deep learning based approach for automated plant disease classification using vision transformer
    Yasamin Borhani
    Javad Khoramdel
    Esmaeil Najafi
    Scientific Reports, 12
  • [32] A deep learning based approach for automated plant disease classification using vision transformer
    Borhani, Yasamin
    Khoramdel, Javad
    Najafi, Esmaeil
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [33] Machine learning-based classification of Parkinson's disease using acoustic features: Insights from multilingual speech tasks
    Department of AI & Informatics, Graduate School, Sangmyung University, Hongjimun 2-Gil 20, Jongno-gu, Seoul
    03016, Korea, Republic of
    不详
    07061, Korea, Republic of
    不详
    03016, Korea, Republic of
    不详
    03080, Korea, Republic of
    Comput. Biol. Med.,
  • [34] Evaluation of Environmental Sound Classification using Vision Transformer
    Wang, Changlong
    Ito, Akinori
    Nose, Takashi
    Chen, Chia-Ping
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 665 - 669
  • [35] An effective human monkeypox classification using vision transformer
    Aloraini, Mohammed
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
  • [36] Image Classification Using Vision Transformer for EtC Images
    Hamano, Genki
    Imaizumi, Shoko
    Kiya, Hitoshi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1506 - 1513
  • [37] A vision transformer for emphysema classification using CT images
    Wu, Yanan
    Qi, Shouliang
    Sun, Yu
    Xia, Shuyue
    Yao, Yudong
    Qian, Wei
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (24):
  • [38] Double vision in Parkinson's disease
    Nebe, A
    Georg, E
    MOVEMENT DISORDERS, 2004, 19 : S138 - S138
  • [39] Image Quality Distortion Classification Using Vision Transformer
    Lynn, Nay Chi
    Shimamura, Tetsuya
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, AINA 2024, 2024, 199 : 353 - 361
  • [40] Transformer-based transfer learning on self-reported voice recordings for Parkinson's disease diagnosis
    Tougui, Ilias
    Zakroum, Mehdi
    Karrakchou, Ouassim
    Ghogho, Mounir
    SCIENTIFIC REPORTS, 2024, 14 (01):