SELF-SUPERVISED SPEAKER RECOGNITION WITH LOSS-GATED LEARNING

被引:21
|
作者
Tao, Ruijie [1 ]
Lee, Kong Aik [2 ]
Das, Rohan Kumar [3 ]
Hautamaki, Ville [1 ,4 ]
Li, Haizhou [1 ,5 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
[3] Fortemedia Singapore, Singapore, Singapore
[4] Univ Eastern Finland, Kuopio, Finland
[5] Chinese Univ Hong Kong, Shenzhen, Peoples R China
基金
新加坡国家研究基金会;
关键词
self-supervised speaker recognition; pseudo label selection; loss-gated learning;
D O I
10.1109/ICASSP43922.2022.9747162
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In self-supervised learning for speaker recognition, pseudo labels are useful as the supervision signals. It is a known fact that a speaker recognition model doesn't always benefit from pseudo labels due to their unreliability. In this work, we observe that a speaker recognition network tends to model the data with reliable labels faster than those with unreliable labels. This motivates us to study a loss-gated learning (LGL) strategy, which extracts the reliable labels through the fitting ability of the neural network during training With the proposed LGL, our speaker recognition model obtains a 46.3% performance gain over the system without it. Further, the proposed self-supervised speaker recognition with LGL trained on the VoxCeleb2 dataset without any labels achieves an equal error rate of 1.66% on the VoxCelebl original test set.
引用
收藏
页码:6142 / 6146
页数:5
相关论文
共 50 条
  • [31] Self-supervised representation learning for surgical activity recognition
    Daniel Paysan
    Luis Haug
    Michael Bajka
    Markus Oelhafen
    Joachim M. Buhmann
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 2037 - 2044
  • [32] SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5834 - 5838
  • [33] CONTRASTIVE SELF-SUPERVISED LEARNING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Zhang, Haoran
    Zou, Yuexian
    Wang, Helin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6713 - 6717
  • [34] Self-Supervised Representation Learning With Path Integral Clustering for Speaker Diarization
    Singh, Prachi
    Ganapathy, Sriram
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1639 - 1649
  • [35] Contrastive Self-Supervised Learning for Optical Music Recognition
    Penarrubia, Carlos
    Valero-Mas, Jose J.
    Calvo-Zaragoza, Jorge
    DOCUMENT ANALYSIS SYSTEMS, DAS 2024, 2024, 14994 : 312 - 326
  • [36] Weather Recognition Using Self-supervised Deep Learning
    Acuna-Escobar, Diego
    Intriago-Pazmino, Monserrate
    Ibarra-Fiallo, Julio
    SMART TECHNOLOGIES, SYSTEMS AND APPLICATIONS, SMARTTECH-IC 2021, 2022, 1532 : 161 - 174
  • [37] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
    Truong, Duc-Tuan
    Anh, Tran The
    Siong, Chng Eng
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1950 - 1955
  • [38] Learning the Relation Between Similarity Loss and Clustering Loss in Self-Supervised Learning
    Ge, Jidong
    Liu, Yuxiang
    Gui, Jie
    Fang, Lanting
    Lin, Ming
    Kwok, James Tin-Yau
    Huang, Liguo
    Luo, Bin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3442 - 3454
  • [39] SELF-SUPERVISED SPEAKER RECOGNITION TRAINING USING HUMAN-MACHINE DIALOGUES
    Cekic, Metehan
    Li, Ruirui
    Chen, Zeya
    Yang, Yuguang
    Stolcke, Andreas
    Madhow, Upamanyu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6132 - 6136
  • [40] Gated Dual Hypergraph Convolutional Networks for Recommendation with Self-supervised Learning
    Gao, Rong
    Liu, Jiakang
    Yu, Yonghong
    Liu, Donghua
    Shao, XiongKai
    Ye, Zhiwei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,