Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
|
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Cochleogram-based adventitious sounds classification using convolutional neural networks
    Mang, L. D.
    Canadas-Quesada, F. J.
    Carabias-Orti, J. J.
    Combarro, E. F.
    Ranilla, J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
  • [2] Classification of adventitious sounds
    Walmsley, WCD
    BRITISH MEDICAL JOURNAL, 1938, 1 : 702 - 702
  • [3] Classification of adventitious sounds
    Nixon, JA
    BRITISH MEDICAL JOURNAL, 1938, 1938 : 870 - 870
  • [4] Classification of adventitious sounds
    Maxwell, J
    BRITISH MEDICAL JOURNAL, 1938, 1938 : 917 - 917
  • [5] Classification of adventitious sounds
    Wynn, WH
    BRITISH MEDICAL JOURNAL, 1938, 1938 : 973 - 973
  • [6] Classification of adventitious sounds
    Hutchison, R
    BRITISH MEDICAL JOURNAL, 1938, 1938 : 752 - 752
  • [7] VisFormers-Combining Vision and Transformers for Enhanced Complex Document Classification
    Dutta, Subhayu
    Adhikary, Subhrangshu
    Dwivedi, Ashutosh Dhar
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (01): : 448 - 463
  • [8] Methodology for Automatic Classification of Adventitious Lung Sounds
    Riella, R. J.
    Nohama, P.
    Maia, J. M.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 1392 - 1395
  • [9] Vision Transformers for Brain Tumor Classification
    Simon, Eliott
    Briassouli, Alexia
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, : 123 - 130
  • [10] Automatic Classification of Adventitious Respiratory Sounds: A (Un)Solved Problem?
    Rocha, Bruno Machado
    Pessoa, Diogo
    Marques, Alda
    Carvalho, Paulo
    Paiva, Rui Pedro
    SENSORS, 2021, 21 (01) : 1 - 19