Privacy Preserving Acoustic Model Training for Speech Recognition

被引:0
|
作者
Tachioka, Yuuki [1 ]
机构
[1] Denso IT Lab, Tokyo, Japan
来源
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In-domain speech data significantly improve the speech recognition performance of acoustic models. However, the data may contain confidential information and exposure of transcriptions may lead to a breach in speakers' privacy. In addition, speaker identification can be problematic when speakers want to hide their membership of a certain group. Thus, the in-domain data must be deleted after its period of use. However, once the data are deleted, models cannot be updated for future architectures. Privacy preservation is necessary when retaining speech data; it is important that the transcriptions cannot be reconstructed and the speaker cannot be identified. This paper proposes a privacy preserving acoustic model training (PPAMT) method that satisfies these requirements and formulates the sensitivities of three features (n-grams, phoneme labels, and acoustic features) for PPAMT. A sensitivity analysis showed that phoneme labels and acoustic features were less susceptible to PPAMT than n-grams, which is optimal because accurate phoneme labels and acoustic features are needed for acoustic model training. Speech recognition experiments showed that the word error rate degradation by PPAMT was less than 0.6% as a result of this property.
引用
收藏
页码:627 / 631
页数:5
相关论文
共 50 条
  • [21] Acoustic model training using self-attention for low-resource speech recognition
    Park, Hosung
    Kim, Ji-Hwan
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 483 - 489
  • [22] Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition
    Smidl, Lubos
    Svec, Jan
    Prazak, Ales
    Trmal, Jan
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 646 - 655
  • [23] Training Speech Recognition Model with Speech Synthesis and Text Discriminator
    Lin, Hou-an
    Chen, Chia-ping
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (02) : 359 - 373
  • [24] EXPLORING HASHING AND CRYPTONET BASED APPROACHES FOR PRIVACY-PRESERVING SPEECH EMOTION RECOGNITION
    Dias, Miguel
    Abad, Alberto
    Trancoso, Isabel
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2057 - 2061
  • [25] Kirigami: Lightweight Speech Filtering for Privacy-Preserving Activity Recognition using Audio
    Boovaraghavan, Sudershan
    Zhou, Haozhe
    Goel, Mayank
    Agarwal, Yuvraj
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (01):
  • [26] ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
    Chennupati, Gopinath
    Rao, Milind
    Chadha, Gurpreet
    Eakin, Aaron
    Raju, Anirudh
    Tiwari, Gautam
    Sahu, Anit Kumar
    Rastrow, Ariya
    Droppo, Jasha
    Oberlin, Andy
    Nandanoor, Buddha
    Venkataramanan, Prahalad
    Wu, Zheng
    Sitpure, Pankaj
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2780 - 2788
  • [27] PRIVACY ATTACKS FOR AUTOMATIC SPEECH RECOGNITION ACOUSTIC MODELS IN A FEDERATED LEARNING FRAMEWORK
    Tomashenko, Natalia
    Mdhaffar, Salima
    Tommasi, Marc
    Esteve, Yannick
    Bonastre, Jean-Francois
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6972 - 6976
  • [28] X-vector anonymization using autoencoders and adversarial training for preserving speech privacy
    Perero-Codosero, Juan M.
    Espinoza-Cuadros, Fernando M.
    Hernandez-Gomez, Luis A.
    COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [29] Speech recognition based on unified model of acoustic and language aspects of speech
    1600, Nippon Telegraph and Telephone Corp. (11):
  • [30] Privacy Preserving Face Recognition Utilizing Differential Privacy
    Chamikara, M. A. P.
    Bertok, P.
    Khalil, I.
    Liu, D.
    Camtepe, S.
    COMPUTERS & SECURITY, 2020, 97