Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
|
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Vision Transformers Applied to Indoor Room Classification
    Veiga, Bruno
    Pinto, Tiago
    Teixeira, Ruben
    Ramos, Carlos
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 561 - 573
  • [22] Classification between normal and adventitious lung sounds using deep neural network
    Li, Lin
    Xu, Wenhao
    Hong, Qingyang
    Tong, Feng
    Wu, Jinzhun
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [23] ACOUSTICAL ANALYSIS OF ADVENTITIOUS LUNG SOUNDS AS AN APPROACH TO AN AUTOMATIC CLASSIFICATION-SYSTEM
    DALMASSO, F
    BENEDETTO, G
    SPAGNOLO, R
    BULLETIN EUROPEEN DE PHYSIOPATHOLOGIE RESPIRATOIRE-CLINICAL RESPIRATORY PHYSIOLOGY, 1986, 22 : S140 - S140
  • [24] CONTINUOUS ADVENTITIOUS LUNG SOUNDS
    KOSTER, MEY
    BAUGHMAN, RP
    LOUDON, RG
    JOURNAL OF ASTHMA, 1990, 27 (04) : 237 - 249
  • [25] Localization of adventitious respiratory sounds
    Henry, Brian
    Royston, Thomas J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (03): : 1297 - 1307
  • [26] DISCONTINUOUS ADVENTITIOUS LUNG SOUNDS
    MURPHY, RLH
    SEMINARS IN RESPIRATORY MEDICINE, 1985, 6 (03): : 210 - 219
  • [27] CellViT: Vision Transformers for precise cell segmentation and classification
    Hoerst, Fabian
    Rempe, Moritz
    Heine, Lukas
    Seibold, Constantin
    Keyl, Julius
    Baldini, Giulia
    Ugurel, Selma
    Siveke, Jens
    Gruenwald, Barbara
    Egger, Jan
    Kleesiek, Jens
    MEDICAL IMAGE ANALYSIS, 2024, 94
  • [28] Vision Transformers for Breast Cancer Histology Image Classification
    Baroni, Giulia L.
    Rasotto, Laura
    Roitero, Kevin
    Siraj, Ameer Hamza
    Della Mea, Vincenzo
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 15 - 26
  • [29] The classification of the bladder cancer based on Vision Transformers (ViT)
    Ola S. Khedr
    Mohamed E. Wahed
    Al-Sayed R. Al-Attar
    E. A. Abdel-Rehim
    Scientific Reports, 13
  • [30] Quantum Vision Transformers for Quark-Gluon Classification
    Comajoan Cara, Marcal
    Dahale, Gopal Ramesh
    Dong, Zhongtian
    Forestano, Roy T.
    Gleyzer, Sergei
    Justice, Daniel
    Kong, Kyoungchul
    Magorsch, Tom
    Matchev, Konstantin T.
    Matcheva, Katia
    Unlu, Eyup B.
    AXIOMS, 2024, 13 (05)