Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking

被引：1

作者：

Konev, A. A. ^{[1
]}

Meshcheryakov, R. V. ^{[1
]}

Kostyuchenko, E. Yu ^{[1
]}

机构：

[1] Tomsk State Univ Control Syst & Radioelect, Pr Lenina 40, Tomsk 634050, Russia

来源：

OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING | 2018年 / 54卷 / 04期

关键词：

speech signal; simultaneous masking; speech signal segmentation; vocalized and unvocalized segments;

D O I：

10.3103/S8756699018040076

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

This paper touches upon a model of simultaneous acoustic masking, which detects speech signal components perceived by a human's auditory system. A simultaneous masking algorithm on the basis of this model is proposed. It is shown that, after simultaneous masking, a signal becomes a binary structure that reflects the harmonic structure of a vocalized sequence. It is experimentally proven that this structure can be used to detect key speech segments (from the standpoint of perception by an auditory system). This structure serves as a basis for an algorithm of high-quality segmentation of a speech signal into vocalized and unvocalized segments, which does not require learning before use. The joint use of the algorithms for simultaneous masking and speech signal segmentation is tested, and their performance is evaluated.

引用

页码：361 / 366

页数：6

共 50 条

[1] Speech Signal Segmentation into Silence, Unvoiced and Vocalized Sections in Speech Rehabilitation
Novokhrestova, Dariya
Kostyuchenko, Evgeny
Krivoshein, Ilya
Balatskaya, Lidiya
SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 601 - 610
[2] CHANGES IN DURATION OF VOCALIZED AND SILENT SPEECH SEGMENTS WITH SPEED OF LOUD READING
MORAVEK, M
BURES, P
KREKULE, I
PHYSIOLOGIA BOHEMOSLOVACA, 1976, 25 (05): : 460 - 460
[3] Simultaneous relative cue reliance in speech-on-speech masking
Lutfi, R. A.
Zandona, M.
Lee, J.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (04): : 2530 - 2538
[4] Photometric Segmentation: Simultaneous Photometric Stereo and Masking
Haefner, Bjoern
Queau, Yvain
Cremers, Daniel
2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 222 - 229
[5] Phoneme Segmentation of Speech Signal
Goh, Y. H.
Raveendran, P.
2009 INTERNATIONAL CONFERENCE FOR TECHNICAL POSTGRADUATES (TECHPOS 2009), 2009, : 150 - 152
[6] SPEECH MASKING .1. SIMULTANEOUS AND NONSIMULTANEOUS MASKING WITHIN STOP /D
SPIEGEL, MF
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (05): : 1492 - 1502
[7] RESTORATION OF MISSING VOICED SPEECH SIGNAL SEGMENTS
Paulikas, Sarunas
ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 520 - 523
[8] Multimodal speaker segmentation in presence of overlapped speech segments
Rozgic, Viktor
Han, Kyu Jeong
Georgiou, Panayiotis G.
Narayanan, Shrikanth
ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 679 - 684
[9] Masking speech with its time-reversed signal
Arai, Takayuki
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2010, 31 (02) : 188 - 190
[10] Informational masking in young and elderly listeners for speech masked by simultaneous speech and noise
Agus, Trevor R.
Akeroyd, Michael A.
Gatehouse, Stuart
Warden, David
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04): : 1926 - 1940

← 1 2 3 4 5 →