A ROBUST SPEAKER CLUSTERING METHOD BASED ON DISCRETE TIED VARIATIONAL AUTOENCODER

被引:0
|
作者
Feng, Chen [1 ]
Wang, Jianzong [1 ]
Li, Tongxu [1 ]
Peng, Junqing [1 ]
Xiao, Jing [1 ]
机构
[1] Ping An Technol Shenzhen Co Ltd, Shenzhen, Guangdong, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
speaker clustering; tied variational autoencoder; mutual information; aggregation hierarchy cluster;
D O I
10.1109/icassp40776.2020.9053488
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, the speaker clustering model based on aggregation hierarchy cluster (AHC) is a common method to solve two main problems: no preset category number clustering and fix category number clustering. In general, model takes features like i-vectors as input of probability and linear discriminant analysis model (PLDA) aims to form the distance matric in long voice application scenario, and then clustering results are obtained through the clustering model. However, traditional speaker clustering method based on AHC has the shortcomings of long-time running and remains sensitive to environment noise. In this paper, we propose a novel speaker clustering method based on Mutual Information (MI) and a non-linear model with discrete variable, which under the enlightenment of Tied Variational Autoencoder (TVAE), to enhance the robustness against noise. The proposed method named Discrete Tied Variational Autoencoder (DTVAE) which shortens the elapsed time substantially. With experience results, it outperforms the general model and yields a relative Accuracy (ACC) improvement and significant time reduction.
引用
收藏
页码:6024 / 6028
页数:5
相关论文
共 50 条
  • [41] Robust deep image clustering using convolutional autoencoder with separable discrete Krawtchouk and Hahn orthogonal moments
    Bouali, Aymane
    El Ouariachi, Ilham
    Zahi, Azeddine
    Zenkouar, Khalid
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
  • [42] A robust unsupervised speaker clustering of speech utterances
    Zhang, SL
    Zhang, SW
    Xu, B
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 115 - 120
  • [43] Robust Speaker Clustering Using Affinity Propagation
    Zhang, Xiang
    Lu, Ping
    Suo, Hongbin
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (11): : 2739 - 2741
  • [44] Variational Autoencoder feature clustering for tissue classification in robotic palpation
    Urrutia, Robin
    Espejo, Diego
    Sühn, Thomas
    Guerra, Montserrat
    Fuentealba, Patricio
    Poblete, Victor
    Boese, Axel
    Illanes, Alfredo
    Current Directions in Biomedical Engineering, 2024, 10 (01) : 89 - 92
  • [45] A Method for Constructing Supervised Time Topic Model Based on Variational Autoencoder
    Gou, Zhinan
    Li, Yan
    Huo, Zheng
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [46] DefVAE: A Defect Detection Method for Catenary Devices Based on Variational Autoencoder
    Lu, Tengfei
    Wang, Zhongli
    Shen, Yan
    Shao, Xiaotao
    Tang, Yonglin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [47] A denoising method of ECG signal based on variational autoencoder and masked convolution
    Xia, Yinghao
    Chen, Changfang
    Shu, Minglei
    Liu, Ruixia
    JOURNAL OF ELECTROCARDIOLOGY, 2023, 80 : 81 - 90
  • [48] Multi-Decoder RNN Autoencoder Based on Variational Bayes Method
    Kaji, Daisuke
    Watanabe, Kazuho
    Kobayashi, Masahiro
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [49] Robust AUV Visual Loop-Closure Detection Based on Variational Autoencoder Network
    Wang, Yangyang
    Ma, Xiaorui
    Wang, Jie
    Hou, Shilong
    Dai, Ju
    Gu, Dongbing
    Wang, Hongyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (12) : 8829 - 8838
  • [50] A modified speaker clustering method for efficient speaker identification
    Yan, JiaChang
    Wang, Lei
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,