Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking

被引：1

作者：

Konev, A. A. ^{[1
]}

Meshcheryakov, R. V. ^{[1
]}

Kostyuchenko, E. Yu ^{[1
]}

机构：

[1] Tomsk State Univ Control Syst & Radioelect, Pr Lenina 40, Tomsk 634050, Russia

来源：

OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING | 2018年 / 54卷 / 04期

关键词：

speech signal; simultaneous masking; speech signal segmentation; vocalized and unvocalized segments;

D O I：

10.3103/S8756699018040076

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

This paper touches upon a model of simultaneous acoustic masking, which detects speech signal components perceived by a human's auditory system. A simultaneous masking algorithm on the basis of this model is proposed. It is shown that, after simultaneous masking, a signal becomes a binary structure that reflects the harmonic structure of a vocalized sequence. It is experimentally proven that this structure can be used to detect key speech segments (from the standpoint of perception by an auditory system). This structure serves as a basis for an algorithm of high-quality segmentation of a speech signal into vocalized and unvocalized segments, which does not require learning before use. The joint use of the algorithms for simultaneous masking and speech signal segmentation is tested, and their performance is evaluated.

引用

页码：361 / 366

页数：6

共 50 条

[21] Application of Wavelet Transform for Speech Signal Segmentation
Smirnov, V. M.
Filatov, V. N.
2019 WAVE ELECTRONICS AND ITS APPLICATION IN INFORMATION AND TELECOMMUNICATION SYSTEMS (WECONF), 2019,
[22] Speech signal segmentation, using semantic units
Gorbachevskii, S. K.
Radioelectronics and Communications Systems, 1995, 38 (08)
[23] ONE OF THE METHODS OF SEGMENTATION OF SPEECH SIGNAL ON SYLLABLES
Mamyrbayev, O. Zh.
Kunanbayeva, M. M.
Sadybekov, K. S.
Kalyzhanova, A. U.
Mamyrbayeva, A. Zh.
BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2015, (02): : 286 - 290
[24] Segmentation of Continuous Punjabi Speech Signal into Syllables
Kaur, Amanpreet
Singh, Tarandeep
WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 598 - +
[25] DETERMINATION OF EQUAL-ARTICULATION SEGMENTS FROM A SPEECH SIGNAL
RYLOV, AS
SOVIET PHYSICS ACOUSTICS-USSR, 1979, 25 (02): : 169 - 172
[26] Binaural Speech Enhancement with Spatial Cue Preservation Utilising Simultaneous Masking
Koutrouvelis, Andreas I.
Jensen, Jesper
Guo, Meng
Hendriks, Richard C.
Heusdens, Richard
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 598 - 602
[27] A Speech Enhancement Method Based on Signal Subspace and Hearing Masking Effect
Jin Chensheng
Zhang Xueying
Jia Hairong
2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 15 - 18
[28] Subband Kalman filtering incorporating masking properties for noisy speech signal
You, Chang Huai
Koh, Soo Ngee
Rahardja, Susanto
SPEECH COMMUNICATION, 2007, 49 (7-8) : 558 - 573
[29] Signal properties that reduce masking by simultaneous, random-frequency maskers
1909, American Inst of Physics, Woodbury, NY, USA (98):
[30] Speech masking. I. Simultaneous and nonsimultaneous masking within stop /d/ and flap /J/ closures
Spiegel, Murray F.
Journal of the Acoustical Society of America, 1987, 82 (05): : 1492 - 1502

← 1 2 3 4 5 →