Prεεch: A System for Privacy-Preserving Speech Transcription

被引:0
|
作者
Ahmed, Shimaa [1 ]
Chowdhury, Amrita Roy [1 ]
Fawaz, Kassem [1 ]
Ramanathan, Parmesh [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
SPEAKER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
New advances in machine learning have made Automated Speech Recognition (AS R) systems practical and more scalable. These systems, however, pose serious privacy threats as speech is a rich source of sensitive acoustic and textual information. Although offline and open-source ASR eliminates the privacy risks, its transcription performance is inferior to that of cloud-based ASR systems, especially for real-world use cases. In this paper, we propose Pr epsilon epsilon ch, an end-to-end speech transcription system which lies at an intermediate point in the privacy-utility spectrum. It protects the acoustic features of the speakers' voices and protects the privacy of the textual content at an improved performance relative to offline ASR. Additionally, Pr epsilon epsilon ch provides several control knobs to allow customizable utility-usability-privacy trade-off. It relies on cloud-based services to transcribe a speech file after applying a series of privacy-preserving operations on the user's side. We perform a comprehensive evaluation of Pr epsilon epsilon ch, using diverse real-world datasets, that demonstrates its effectiveness. Pr epsilon epsilon ch provides transcription at a 2% to 32.25% (mean 17.34%) relative improvement in word error rate over Deep Speech, while fully obfuscating the speakers' voice biometrics and allowing only a differentially private view of the textual content.
引用
收藏
页码:2703 / 2720
页数:18
相关论文
共 50 条
  • [41] Privacy-Preserving Polling
    Shasha, Dennis
    COMMUNICATIONS OF THE ACM, 2020, 63 (10) : 108 - +
  • [42] Privacy-Preserving WSDM
    Korolova, Aleksandra
    PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 4 - 4
  • [43] Privacy-Preserving Statistics
    Vaidya, Jaideep
    COMPUTER, 2018, 51 (09) : 8 - 9
  • [44] Generating gender-ambiguous voices for privacy-preserving speech recognition
    Stoidis, Dimitrios
    Cavallaro, Andrea
    INTERSPEECH 2022, 2022, : 4237 - 4241
  • [45] A novel privacy-preserving speech recognition framework using bidirectional LSTM
    Qingren Wang
    Chuankai Feng
    Yan Xu
    Hong Zhong
    Victor S. Sheng
    Journal of Cloud Computing, 9
  • [46] A novel privacy-preserving speech recognition framework using bidirectional LSTM
    Wang, Qingren
    Feng, Chuankai
    Xu, Yan
    Zhong, Hong
    Sheng, Victor S.
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2020, 9 (01):
  • [47] PRIDE: A Privacy-Preserving Decentralised Key Management System
    Kester, David
    Li, Tianyu
    Erkin, Zekeriya
    2022 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2022,
  • [48] A Privacy-Preserving Mobile Payment System for Mass Transit
    Kang, Jeonil
    Nyang, DaeHun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (08) : 2192 - 2205
  • [49] An efficient privacy-preserving recommender system in wireless networks
    Luo, Junwei
    Yi, Xun
    Han, Fengling
    Yang, Xuechao
    WIRELESS NETWORKS, 2024, 30 (06) : 4949 - 4960
  • [50] RESEARCH ON CO-LOCATION PRIVACY-PRESERVING SYSTEM
    Li, Jiachun
    Xiong, Dongqing
    Cao, Jianzhou
    2018 14TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2018), 2018, : 183 - 187