DERIVING SPECTRO-TEMPORAL PROPERTIES OF HEARING FROM SPEECH DATA

被引:0
|
作者
Ondel, Lucas [1 ,3 ]
Li, Ruizhi [1 ]
Sell, Gregory [1 ,2 ]
Hermansky, Hynek [1 ,2 ,3 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD USA
[3] Brno Univ Technol, FIT, Ctr Excellence IT4I, Brno, Czech Republic
基金
美国国家科学基金会;
关键词
perception; spectro-temporal; auditory; deep learning; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human hearing and human speech are intrinsically tied together, as the properties of speech almost certainly developed in order to be heard by human ears. As a result of this connection, it has been shown that certain properties of human hearing are mimicked within data-driven systems that are trained to understand human speech. In this paper, we further explore this phenomenon by measuring the spectro-temporal responses of data-derived filters in a front-end convolutional layer of a deep network trained to classify the phonemes of clean speech. The analyses show that the filters do indeed exhibit spectro-temporal responses similar to those measured in mammals, and also that the filters exhibit an additional level of frequency selectivity, similar to the processing pipeline assumed within the Articulation Index.
引用
收藏
页码:411 / 415
页数:5
相关论文
共 50 条
  • [41] On the Suitability of the Riesz Spectro-Temporal Envelope for WaveNet Based Speech Synthesis
    Dhiman, Jitendra Kumar
    Adiga, Nagaraj
    Seelamantula, Chandra Sekhar
    INTERSPEECH 2019, 2019, : 944 - 948
  • [42] Spectro-Temporal Weighting of Loudness
    Oberfeld, Daniel
    Heeren, Wiebke
    Rennies, Jan
    Verhey, Jesko
    PLOS ONE, 2012, 7 (11):
  • [43] Spectro-temporal Encoding of Speech Responses in Glioma-Infiltrated Cortex
    Aabedi, Alexander
    Lipkin, Benjamin
    Young, Jacob
    Krishna, Saritha
    Kakaizada, Sofia
    Kaur, Jasleen
    Berger, Mitchel
    Brang, David
    Hervey-Jumper, Shawn
    JOURNAL OF NEUROSURGERY, 2021, 135 (02) : 15 - 15
  • [44] Effects of hearing level and spectro-temporal pattern in the identification of environmental sounds in children with hearing impairment
    Tabaru, Kei
    Kobayashi, Yuko
    Harashima, Tsuneo
    Katada, Akiyoshi
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 210 - 210
  • [45] Cognitive Abilities Contribute to Spectro-Temporal Discrimination in Children Who Are Hard of Hearing
    Kirby, Benjamin J.
    Spratford, Meredith
    Klein, Kelsey E.
    McCreery, Ryan W.
    EAR AND HEARING, 2019, 40 (03): : 645 - 650
  • [46] Neural responses to speech-specific modulations derived from a spectro-temporal filter bank
    Frye, Marina
    Micheli, Cristiano
    Schepers, Inga M.
    Schalk, Gerwin
    Rieger, Jochem W.
    Meyer, Bernd T.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1368 - 1372
  • [47] Spectro-Temporal Characteristics of Speech at High Frequencies, and the Potential for Restoration of Audibility to People with Mild-to-Moderate Hearing Loss
    Moore, Brian C. J.
    Stone, Michael A.
    Fullgrabe, Christian
    Glasberg, Brian R.
    Puria, Sunil
    EAR AND HEARING, 2008, 29 (06): : 907 - 922
  • [48] SPECTRO-TEMPORAL ANALYSIS IN NORMAL-HEARING AND COCHLEAR-IMPAIRED LISTENERS
    HALL, JW
    DAVIS, AC
    HAGGARD, MP
    PILLSBURY, HC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (04): : 1325 - 1331
  • [49] SPECTRO-TEMPORAL CHARACTERISTICS FROM CHAINS OF TYPE I
    da Luz Sodre, Zuleika Auxiliadora
    Rocha Fernandes, Francisco Carlos
    REVISTA UNIVAP, 2013, 19 (34) : 76 - 81
  • [50] Versatile Parametric Spectro-Temporal Analyzer
    Zhang, Chi
    Wong, Kenneth K. Y.
    2014 IEEE PHOTONICS SOCIETY SUMMER TOPICAL MEETING SERIES, 2014, : 132 - 133