Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
|
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] The classification of the bladder cancer based on Vision Transformers (ViT)
    Khedr, Ola S.
    Wahed, Mohamed E.
    Al-Attar, Al-Sayed R.
    Abdel-Rehim, E. A.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [32] Optimizing Mobile Vision Transformers for Land Cover Classification
    Rozario, Papia F.
    Gadgil, Ravi
    Lee, Junsu
    Gomes, Rahul
    Keller, Paige
    Liu, Yiheng
    Sipos, Gabriel
    Mcdonnell, Grace
    Impola, Westin
    Rudolph, Joseph
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [33] Image forgery classification and localization through vision transformers
    Pawar, Digambar
    Gowda, Raghavendra
    Chandra, Krishna
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
  • [34] Vision Transformers Based Classification for Glaucomatous Eye Condition
    Wassel, Moustafa
    Hamdi, Ahmed M.
    Adly, Noha
    Torki, Marwan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 5082 - 5088
  • [35] Exploring Vision Transformers for Polarimetric SAR Image Classification
    Dong, Hongwei
    Zhang, Lamei
    Zou, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [36] Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy
    Rozendo, Guilherme Botazzo
    Roberto, Guilherme Freire
    Zanchetta do Nascimento, Marcelo
    Neves, Leandro Alves
    Lumini, Alessandra
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 229 - 243
  • [37] Classification and analysis of non-stationary characteristics of crackle and rhonchus lung adventitious sounds
    Icer, Semra
    Gengec, Serife
    DIGITAL SIGNAL PROCESSING, 2014, 28 : 18 - 27
  • [38] The occurrence of adventitious sounds in the normal chest
    Rudolf, RD
    LANCET, 1910, 1 : 1098 - 1098
  • [39] Enhancing furcation involvement classification on panoramic radiographs with vision transformers
    Zhang, Xuan
    Guo, Enting
    Liu, Xu
    Zhao, Hong
    Yang, Jie
    Li, Wen
    Wu, Wenlei
    Sun, Weibin
    BMC ORAL HEALTH, 2025, 25 (01):
  • [40] Harnessing the power of vision transformers for enhanced OCT image classification
    Paraschiv, Elena-Anca
    Sultana, Alina-Elena
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2024, 34 (02):