Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking

被引:1
|
作者
Konev, A. A. [1 ]
Meshcheryakov, R. V. [1 ]
Kostyuchenko, E. Yu [1 ]
机构
[1] Tomsk State Univ Control Syst & Radioelect, Pr Lenina 40, Tomsk 634050, Russia
关键词
speech signal; simultaneous masking; speech signal segmentation; vocalized and unvocalized segments;
D O I
10.3103/S8756699018040076
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper touches upon a model of simultaneous acoustic masking, which detects speech signal components perceived by a human's auditory system. A simultaneous masking algorithm on the basis of this model is proposed. It is shown that, after simultaneous masking, a signal becomes a binary structure that reflects the harmonic structure of a vocalized sequence. It is experimentally proven that this structure can be used to detect key speech segments (from the standpoint of perception by an auditory system). This structure serves as a basis for an algorithm of high-quality segmentation of a speech signal into vocalized and unvocalized segments, which does not require learning before use. The joint use of the algorithms for simultaneous masking and speech signal segmentation is tested, and their performance is evaluated.
引用
收藏
页码:361 / 366
页数:6
相关论文
共 50 条
  • [1] Speech Signal Segmentation into Silence, Unvoiced and Vocalized Sections in Speech Rehabilitation
    Novokhrestova, Dariya
    Kostyuchenko, Evgeny
    Krivoshein, Ilya
    Balatskaya, Lidiya
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 601 - 610
  • [2] CHANGES IN DURATION OF VOCALIZED AND SILENT SPEECH SEGMENTS WITH SPEED OF LOUD READING
    MORAVEK, M
    BURES, P
    KREKULE, I
    PHYSIOLOGIA BOHEMOSLOVACA, 1976, 25 (05): : 460 - 460
  • [3] Simultaneous relative cue reliance in speech-on-speech masking
    Lutfi, R. A.
    Zandona, M.
    Lee, J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (04): : 2530 - 2538
  • [4] Photometric Segmentation: Simultaneous Photometric Stereo and Masking
    Haefner, Bjoern
    Queau, Yvain
    Cremers, Daniel
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 222 - 229
  • [5] Phoneme Segmentation of Speech Signal
    Goh, Y. H.
    Raveendran, P.
    2009 INTERNATIONAL CONFERENCE FOR TECHNICAL POSTGRADUATES (TECHPOS 2009), 2009, : 150 - 152
  • [6] SPEECH MASKING .1. SIMULTANEOUS AND NONSIMULTANEOUS MASKING WITHIN STOP /D
    SPIEGEL, MF
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (05): : 1492 - 1502
  • [7] RESTORATION OF MISSING VOICED SPEECH SIGNAL SEGMENTS
    Paulikas, Sarunas
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 520 - 523
  • [8] Multimodal speaker segmentation in presence of overlapped speech segments
    Rozgic, Viktor
    Han, Kyu Jeong
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 679 - 684
  • [9] Masking speech with its time-reversed signal
    Arai, Takayuki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2010, 31 (02) : 188 - 190
  • [10] Informational masking in young and elderly listeners for speech masked by simultaneous speech and noise
    Agus, Trevor R.
    Akeroyd, Michael A.
    Gatehouse, Stuart
    Warden, David
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04): : 1926 - 1940