Privacy Preserving Acoustic Model Training for Speech Recognition

被引：0

作者：

Tachioka, Yuuki ^{[1
]}

机构：

[1] Denso IT Lab, Tokyo, Japan

来源：

2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In-domain speech data significantly improve the speech recognition performance of acoustic models. However, the data may contain confidential information and exposure of transcriptions may lead to a breach in speakers' privacy. In addition, speaker identification can be problematic when speakers want to hide their membership of a certain group. Thus, the in-domain data must be deleted after its period of use. However, once the data are deleted, models cannot be updated for future architectures. Privacy preservation is necessary when retaining speech data; it is important that the transcriptions cannot be reconstructed and the speaker cannot be identified. This paper proposes a privacy preserving acoustic model training (PPAMT) method that satisfies these requirements and formulates the sensitivities of three features (n-grams, phoneme labels, and acoustic features) for PPAMT. A sensitivity analysis showed that phoneme labels and acoustic features were less susceptible to PPAMT than n-grams, which is optimal because accurate phoneme labels and acoustic features are needed for acoustic model training. Speech recognition experiments showed that the word error rate degradation by PPAMT was less than 0.6% as a result of this property.

引用

页码：627 / 631

页数：5

共 50 条

[21] Acoustic model training using self-attention for low-resource speech recognition
Park, Hosung
Kim, Ji-Hwan
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 483 - 489
[22] Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition
Smidl, Lubos
Svec, Jan
Prazak, Ales
Trmal, Jan
SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 646 - 655
[23] Training Speech Recognition Model with Speech Synthesis and Text Discriminator
Lin, Hou-an
Chen, Chia-ping
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (02) : 359 - 373
[24] EXPLORING HASHING AND CRYPTONET BASED APPROACHES FOR PRIVACY-PRESERVING SPEECH EMOTION RECOGNITION
Dias, Miguel
Abad, Alberto
Trancoso, Isabel
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2057 - 2061
[25] Kirigami: Lightweight Speech Filtering for Privacy-Preserving Activity Recognition using Audio
Boovaraghavan, Sudershan
Zhou, Haozhe
Goel, Mayank
Agarwal, Yuvraj
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (01):
[26] ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Chennupati, Gopinath
Rao, Milind
Chadha, Gurpreet
Eakin, Aaron
Raju, Anirudh
Tiwari, Gautam
Sahu, Anit Kumar
Rastrow, Ariya
Droppo, Jasha
Oberlin, Andy
Nandanoor, Buddha
Venkataramanan, Prahalad
Wu, Zheng
Sitpure, Pankaj
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2780 - 2788
[27] PRIVACY ATTACKS FOR AUTOMATIC SPEECH RECOGNITION ACOUSTIC MODELS IN A FEDERATED LEARNING FRAMEWORK
Tomashenko, Natalia
Mdhaffar, Salima
Tommasi, Marc
Esteve, Yannick
Bonastre, Jean-Francois
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6972 - 6976
[28] X-vector anonymization using autoencoders and adversarial training for preserving speech privacy
Perero-Codosero, Juan M.
Espinoza-Cuadros, Fernando M.
Hernandez-Gomez, Luis A.
COMPUTER SPEECH AND LANGUAGE, 2022, 74
[29] Speech recognition based on unified model of acoustic and language aspects of speech
1600, Nippon Telegraph and Telephone Corp. (11):
[30] Privacy Preserving Face Recognition Utilizing Differential Privacy
Chamikara, M. A. P.
Bertok, P.
Khalil, I.
Liu, D.
Camtepe, S.
COMPUTERS & SECURITY, 2020, 97

← 1 2 3 4 5 →