Prεεch: A System for Privacy-Preserving Speech Transcription

被引:0
|
作者
Ahmed, Shimaa [1 ]
Chowdhury, Amrita Roy [1 ]
Fawaz, Kassem [1 ]
Ramanathan, Parmesh [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
SPEAKER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
New advances in machine learning have made Automated Speech Recognition (AS R) systems practical and more scalable. These systems, however, pose serious privacy threats as speech is a rich source of sensitive acoustic and textual information. Although offline and open-source ASR eliminates the privacy risks, its transcription performance is inferior to that of cloud-based ASR systems, especially for real-world use cases. In this paper, we propose Pr epsilon epsilon ch, an end-to-end speech transcription system which lies at an intermediate point in the privacy-utility spectrum. It protects the acoustic features of the speakers' voices and protects the privacy of the textual content at an improved performance relative to offline ASR. Additionally, Pr epsilon epsilon ch provides several control knobs to allow customizable utility-usability-privacy trade-off. It relies on cloud-based services to transcribe a speech file after applying a series of privacy-preserving operations on the user's side. We perform a comprehensive evaluation of Pr epsilon epsilon ch, using diverse real-world datasets, that demonstrates its effectiveness. Pr epsilon epsilon ch provides transcription at a 2% to 32.25% (mean 17.34%) relative improvement in word error rate over Deep Speech, while fully obfuscating the speakers' voice biometrics and allowing only a differentially private view of the textual content.
引用
收藏
页码:2703 / 2720
页数:18
相关论文
共 50 条
  • [1] Privacy-Preserving Speech Processing
    Pathak, Manas A.
    Raj, Bhiksha
    Rane, Shantanu
    Smaragdis, Paris
    IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (02) : 62 - 74
  • [2] A privacy-preserving ticketing system
    Verslype, Kristof
    De Decker, Bart
    NaessenS, Vincent
    Nigusse, Girma
    Lapon, Jorn
    Verhaeghe, Pieter
    DATA AND APPLICATIONS SECURITY XXII, 2008, 5094 : 97 - +
  • [3] Privacy-Preserving Speaker Verification and Speech Recognition
    Abbasi, Wisam
    EMERGING TECHNOLOGIES FOR AUTHORIZATION AND AUTHENTICATION, ETAA 2022, 2023, 13782 : 102 - 119
  • [4] Privacy-preserving Representation Learning for Speech Understanding
    Minh Tran
    Soleymani, Mohammad
    INTERSPEECH 2023, 2023, : 2858 - 2862
  • [5] Configurable Privacy-Preserving Automatic Speech Recognition
    Aloufi, Ranya
    Haddadi, Hamed
    Boyle, David
    INTERSPEECH 2021, 2021, : 861 - 865
  • [6] Towards Privacy-Preserving Speech Data Publishing
    Qian, Jianwei
    Han, Feng
    Hou, Jiahui
    Zhang, Chunhong
    Wang, Yu
    Li, Xiang-Yang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 1088 - 1096
  • [7] PRIVACY-PRESERVING QUERY-BY-EXAMPLE SPEECH SEARCH
    Portelo, Jose
    Abad, Alberto
    Raj, Bhiksha
    Trancoso, Isabel
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1797 - 1801
  • [8] A Practical Privacy-Preserving Recommender System
    Badsha, Shahriar
    Yi, Xun
    Khalil, Ibrahim
    DATA SCIENCE AND ENGINEERING, 2016, 1 (03) : 161 - 177
  • [9] Privacy-Preserving Remote Knowledge System
    Dahlmanns, Markus
    Dax, Chris
    Matzutt, Roman
    Pennekamp, Jan
    Hiller, Jens
    Wehrle, Klaus
    2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
  • [10] SRide: A Privacy-Preserving Ridesharing System
    Aivodji, Ulrich Matchi
    Huguenin, Kevin
    Huguet, Marie-Jose
    Killijian, Marc-Olivier
    WISEC'18: PROCEEDINGS OF THE 11TH ACM CONFERENCE ON SECURITY & PRIVACY IN WIRELESS AND MOBILE NETWORKS, 2018, : 40 - 50