Simultaneous speaker identification and watermarking

被引:2
|
作者
Abd El-Wahab, Basant S. [1 ]
El-khobby, Heba A. [1 ]
Abd Elnaby, Mustafa M. [1 ]
Abd El-Samie, Fathi E. [2 ]
机构
[1] Tanta Univ, Fac Engn, Dept Elect & Elect Commun Engn, Tanta, Egypt
[2] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun, Al Minufiyah, Egypt
关键词
Biometric systems; Speech watermarking; Empirical mode decomposition; Mel frequency cepstral coefficients; Speech enhancement; Speaker identification;
D O I
10.1007/s10772-019-09658-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Biometric template protection of speech signals and information hiding in speech signals are two challenging issues. To resolve such limitations and increase the level of security, our objective is to build multi-level security systems based on speech signals. So, speech watermarking is used simultaneously with automatic speaker identification. The speech watermarking is performed to embed images into the speech signals that are used for speaker identification. The watermark is extracted for authentication, and then the effect of watermark removal on the performance of the speaker identification system in the presence of degradations is studied. This paper presents an approach for speech watermarking based on empirical mode decomposition (EMD) in different transform domains and singular value decomposition (SVD). The speech signal is decomposed in different transform domains with EMD to yield zero-mean components called intrinsic mode functions (IMFs). The watermark is inserted into one of these IMF components with SVD. A comparison between different transform domains for implementing the proposed watermarking scheme on different IMFs is presented. The log-likelihood ratio (LLR), correlation coefficient (C-r), signal-to-noise ratio (SNR), and spectral distortion (SD) are used as metrics for the comparison. According to the simulation results, we find that the watermark embedding in the discrete sine transform domain provides higher SNR and C-r values and lower SD and LLR values. The proposed approach is robust to different attacks.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [21] Speaker identification by lipreading
    Luettin, J
    Thacker, NA
    Beet, SW
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 62 - 65
  • [22] A speaker identification agent
    Julia, LE
    Heck, LP
    Cheyer, AJ
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 261 - 266
  • [23] COMPOSITIONAL EMBEDDING MODELS FOR SPEAKER IDENTIFICATION AND DIARIZATION WITH SIMULTANEOUS SPEECH FROM 2+SPEAKERS
    Li, Zeqian
    Whitehill, Jacob
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7163 - 7167
  • [24] Reducing speaker model search space in speaker identification
    De Leon, Phillip L.
    Apsingekar, Vijendra
    2007 BIOMETRICS SYMPOSIUM, 2007, : 90 - 95
  • [25] Simultaneous Enhancement and Watermarking of Speech Signals
    abd-ElMordy, Eman
    El-Gazar, Safaa
    Abbas, Alaa M.
    El-Dolil, Sami
    El-Dokany, Ibrahim M.
    Dessouky, Moawad, I
    El-Rabaie, El-Sayed M.
    El-Fishawy, Adel S.
    Abd El-Samie, Fathi E.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 219 - 234
  • [26] Simultaneous Enhancement and Watermarking of Speech Signals
    Eman abd-ElMordy
    Safaa el-Gazar
    Alaa M. Abbas
    Sami El-Dolil
    Ibrahim M. El-Dokany
    Moawad I. Dessouky
    El-Sayed M. El-Rabaie
    Adel S. El-Fishawy
    Fathi E. Abd El-Samie
    International Journal of Speech Technology, 2021, 24 : 219 - 234
  • [27] A modified speaker clustering method for efficient speaker identification
    Yan, JiaChang
    Wang, Lei
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [28] SPEAKER IDENTIFICATION AND MESSAGE IDENTIFICATION IN SPEECH RECOGNITION
    GARVIN, PL
    LADEFOGED, P
    PHONETICA, 1963, 9 (04) : 193 - 199
  • [29] Perceptual Features in Speaker Identification
    Segarceanu, Svetlana
    Zaharia, Tiberius
    Radoi, Constantin
    PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2010, : 95 - 98
  • [30] INVESTIGATION OF SPEAKER PHOTOGRAPH IDENTIFICATION
    LASS, NJ
    HARVEY, LA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 (05): : 1232 - 1236