DERIVING SPECTRO-TEMPORAL PROPERTIES OF HEARING FROM SPEECH DATA

被引:0
|
作者
Ondel, Lucas [1 ,3 ]
Li, Ruizhi [1 ]
Sell, Gregory [1 ,2 ]
Hermansky, Hynek [1 ,2 ,3 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD USA
[3] Brno Univ Technol, FIT, Ctr Excellence IT4I, Brno, Czech Republic
基金
美国国家科学基金会;
关键词
perception; spectro-temporal; auditory; deep learning; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human hearing and human speech are intrinsically tied together, as the properties of speech almost certainly developed in order to be heard by human ears. As a result of this connection, it has been shown that certain properties of human hearing are mimicked within data-driven systems that are trained to understand human speech. In this paper, we further explore this phenomenon by measuring the spectro-temporal responses of data-derived filters in a front-end convolutional layer of a deep network trained to classify the phonemes of clean speech. The analyses show that the filters do indeed exhibit spectro-temporal responses similar to those measured in mammals, and also that the filters exhibit an additional level of frequency selectivity, similar to the processing pipeline assumed within the Articulation Index.
引用
收藏
页码:411 / 415
页数:5
相关论文
共 50 条
  • [1] Aging and Spectro-Temporal Integration of Speech
    Grose, John H.
    Porter, Heather L.
    Buss, Emily
    TRENDS IN HEARING, 2016, 20
  • [2] Development of spectro-temporal features of speech in children
    Gautam S.
    Singh L.
    Gautam, Sumanlata (suman.gautam82@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 543 - 551
  • [3] SPECTRO-TEMPORAL NEURAL FACTORIZATION FOR SPEECH DEREVERBERATION
    Chien, Jen-Tzung
    Kuo, Kuan-Ting
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5449 - 5453
  • [4] A spectro-temporal modulation test for predicting speech reception in hearing-impaired listeners with hearing aids
    Zaar, Johannes
    Simonsen, Lisbeth Birkelund
    Laugesen, Soren
    HEARING RESEARCH, 2024, 443
  • [5] Localized spectro-temporal cepstral analysis of speech
    Bouvrie, Jake
    Ezzat, Tony
    Poggio, Tomaso
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4733 - 4736
  • [6] Speaker sex effects on temporal and spectro-temporal measures of speech
    Herrmann, Frank
    Cunningham, Stuart P.
    Whiteside, Sandra P.
    JOURNAL OF THE INTERNATIONAL PHONETIC ASSOCIATION, 2014, 44 (01) : 59 - 74
  • [7] Spectro-Temporal Sparsity Characterization for Dysarthric Speech Detection
    Kodrasi, Ina
    Bourlard, Herve
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1210 - 1222
  • [8] Spectro-Temporal Representation of Speech for Intelligibility Assessment of Dysarthria
    Chandrashekar, H. M.
    Karjigi, Veena
    Sreedevi, N.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 390 - 399
  • [9] Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    Mesgarani, N
    Slaney, M
    Shamma, SA
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 920 - 930
  • [10] Data-Driven and Feedback Based Spectro-Temporal Features for Speech Recognition
    Sivaram, G. S. V. S.
    Nemala, Sridhar Krishna
    Mesgarani, Nima
    Hermansky, Hynek
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (11) : 957 - 960