Combining Self-supervised Learning and Adversarial Training based Domain Adaptation for Speaker Verification

被引:0
|
作者
Chen, Zhengyang [1 ]
Wang, Shuai [2 ,3 ]
Han, Bing [1 ]
Qian, Yanmin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Auditory Cognit & Computat Acoust Lab, MoE Key Lab Artificial Intelligence,AI Inst, Shanghai, Peoples R China
[2] Shenzhen Res Inst Big Data, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Shenzhen, Peoples R China
关键词
speech verification; domain adaptation; adversarial training; self-supervised learning;
D O I
10.1109/ISCSLP63861.2024.10800283
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adapting an existing well-trained system to a new domain using only unlabeled data is a highly sought-after yet challenging task for speaker verification in real-world scenarios. In this paper, we study two different domain adaptation methods, the adversarial domain adaptation (ADA) and the self-supervised learning-based domain adaptation (SSDA). To facilitate the deployment of unsupervised adaptation methods in applications, we conduct a detailed analysis of the characteristics of both the ADA and SSDA adaptation strategies. Our findings indicate that the SSDA strategy's performance is highly influenced by the amount of target domain data, whereas the ADA strategy is relatively insensitive to data quantity. Furthermore, augmenting target domain data enhances SSDA system performance but diminishes ADA performance. To further enhance system performance, we explore the complementarity between ADA and SSDA. Our results demonstrate that ADA and SSDA complement each other. When both strategies are applied jointly, the best system achieves over 20.0% relative Equal Error Rate (EER) improvement on the Cnceleb evaluation set and over 35.0% relative average EER improvement on the SRE16 Cantonese and Tagalog evaluation set under domain mismatched conditions.
引用
收藏
页码:701 / 705
页数:5
相关论文
共 50 条
  • [1] SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5834 - 5838
  • [2] Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
    Wu, Haibin
    Li, Xu
    Liu, Andy T.
    Wu, Zhiyong
    Meng, Helen
    Lee, Hung-Yi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 202 - 217
  • [3] Augmentation Adversarial Training for Self-Supervised Speaker Representation Learning
    Kang, Jingu
    Huh, Jaesung
    Heo, Hee Soo
    Chung, Joon Son
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1253 - 1262
  • [4] ADVERSARIAL DEFENSE FOR AUTOMATIC SPEAKER VERIFICATION BY CASCADED SELF-SUPERVISED LEARNING MODELS
    Wu, Haibin
    Li, Xu
    Liu, Andy T.
    Wu, Zhiyong
    Meng, Helen
    Lee, Hung-yi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6718 - 6722
  • [5] Self-supervised learning based domain regularization for mask-wearing speaker verification
    Zhang, Ruiteng
    Wei, Jianguo
    Lu, Xugang
    Lu, Wenhuan
    Jin, Di
    Zhang, Lin
    Ji, Yantao
    Xu, Junhai
    SPEECH COMMUNICATION, 2023, 152
  • [6] Curriculum learning for self-supervised speaker verification
    Heo, Hee-Soo
    Jung, Jee-weon
    Kang, Jingu
    Kwon, Youngki
    Kim, You Jin
    Lee, Bong-Jin
    Chung, Joon Son
    INTERSPEECH 2023, 2023, : 4693 - 4697
  • [7] Self-Supervised Adversarial Learning for Domain Adaptation of Pavement Distress Classification
    Wu, Yanwen
    Hong, Mingjian
    Li, Ao
    Huang, Sheng
    Liu, Huijun
    Ge, Yongxin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1966 - 1977
  • [8] ROBUST SPEAKER VERIFICATION WITH JOINT SELF-SUPERVISED AND SUPERVISED LEARNING
    Wang, Kai
    Zhang, Xiaolei
    Zhang, Miao
    Li, Yuguang
    Lee, Jaeyun
    Cho, Kiho
    Park, Sung-UN
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7637 - 7641
  • [9] Self-Supervised Domain Adaptation with Consistency Training
    Xiao, Liang
    Xu, Jiaolong
    Zhao, Dawei
    Wang, Zhiyu
    Wang, Li
    Nie, Yiming
    Dai, Bin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6874 - 6880
  • [10] SELF-SUPERVISED ADVERSARIAL TRAINING
    Chen, Kejiang
    Chen, Yuefeng
    Zhou, Hang
    Mao, Xiaofeng
    Li, Yuhong
    He, Yuan
    Xue, Hui
    Zhang, Weiming
    Yu, Nenghai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2218 - 2222