Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking

被引:1
|
作者
Konev, A. A. [1 ]
Meshcheryakov, R. V. [1 ]
Kostyuchenko, E. Yu [1 ]
机构
[1] Tomsk State Univ Control Syst & Radioelect, Pr Lenina 40, Tomsk 634050, Russia
关键词
speech signal; simultaneous masking; speech signal segmentation; vocalized and unvocalized segments;
D O I
10.3103/S8756699018040076
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper touches upon a model of simultaneous acoustic masking, which detects speech signal components perceived by a human's auditory system. A simultaneous masking algorithm on the basis of this model is proposed. It is shown that, after simultaneous masking, a signal becomes a binary structure that reflects the harmonic structure of a vocalized sequence. It is experimentally proven that this structure can be used to detect key speech segments (from the standpoint of perception by an auditory system). This structure serves as a basis for an algorithm of high-quality segmentation of a speech signal into vocalized and unvocalized segments, which does not require learning before use. The joint use of the algorithms for simultaneous masking and speech signal segmentation is tested, and their performance is evaluated.
引用
收藏
页码:361 / 366
页数:6
相关论文
共 50 条
  • [21] Application of Wavelet Transform for Speech Signal Segmentation
    Smirnov, V. M.
    Filatov, V. N.
    2019 WAVE ELECTRONICS AND ITS APPLICATION IN INFORMATION AND TELECOMMUNICATION SYSTEMS (WECONF), 2019,
  • [22] Speech signal segmentation, using semantic units
    Gorbachevskii, S. K.
    Radioelectronics and Communications Systems, 1995, 38 (08)
  • [23] ONE OF THE METHODS OF SEGMENTATION OF SPEECH SIGNAL ON SYLLABLES
    Mamyrbayev, O. Zh.
    Kunanbayeva, M. M.
    Sadybekov, K. S.
    Kalyzhanova, A. U.
    Mamyrbayeva, A. Zh.
    BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2015, (02): : 286 - 290
  • [24] Segmentation of Continuous Punjabi Speech Signal into Syllables
    Kaur, Amanpreet
    Singh, Tarandeep
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 598 - +
  • [25] DETERMINATION OF EQUAL-ARTICULATION SEGMENTS FROM A SPEECH SIGNAL
    RYLOV, AS
    SOVIET PHYSICS ACOUSTICS-USSR, 1979, 25 (02): : 169 - 172
  • [26] Binaural Speech Enhancement with Spatial Cue Preservation Utilising Simultaneous Masking
    Koutrouvelis, Andreas I.
    Jensen, Jesper
    Guo, Meng
    Hendriks, Richard C.
    Heusdens, Richard
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 598 - 602
  • [27] A Speech Enhancement Method Based on Signal Subspace and Hearing Masking Effect
    Jin Chensheng
    Zhang Xueying
    Jia Hairong
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 15 - 18
  • [28] Subband Kalman filtering incorporating masking properties for noisy speech signal
    You, Chang Huai
    Koh, Soo Ngee
    Rahardja, Susanto
    SPEECH COMMUNICATION, 2007, 49 (7-8) : 558 - 573
  • [29] Signal properties that reduce masking by simultaneous, random-frequency maskers
    1909, American Inst of Physics, Woodbury, NY, USA (98):
  • [30] Speech masking. I. Simultaneous and nonsimultaneous masking within stop /d/ and flap /J/ closures
    Spiegel, Murray F.
    Journal of the Acoustical Society of America, 1987, 82 (05): : 1492 - 1502