A ROBUST SPEAKER CLUSTERING METHOD BASED ON DISCRETE TIED VARIATIONAL AUTOENCODER

被引:0
|
作者
Feng, Chen [1 ]
Wang, Jianzong [1 ]
Li, Tongxu [1 ]
Peng, Junqing [1 ]
Xiao, Jing [1 ]
机构
[1] Ping An Technol Shenzhen Co Ltd, Shenzhen, Guangdong, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
speaker clustering; tied variational autoencoder; mutual information; aggregation hierarchy cluster;
D O I
10.1109/icassp40776.2020.9053488
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, the speaker clustering model based on aggregation hierarchy cluster (AHC) is a common method to solve two main problems: no preset category number clustering and fix category number clustering. In general, model takes features like i-vectors as input of probability and linear discriminant analysis model (PLDA) aims to form the distance matric in long voice application scenario, and then clustering results are obtained through the clustering model. However, traditional speaker clustering method based on AHC has the shortcomings of long-time running and remains sensitive to environment noise. In this paper, we propose a novel speaker clustering method based on Mutual Information (MI) and a non-linear model with discrete variable, which under the enlightenment of Tied Variational Autoencoder (TVAE), to enhance the robustness against noise. The proposed method named Discrete Tied Variational Autoencoder (DTVAE) which shortens the elapsed time substantially. With experience results, it outperforms the general model and yields a relative Accuracy (ACC) improvement and significant time reduction.
引用
收藏
页码:6024 / 6028
页数:5
相关论文
共 50 条
  • [31] A generative design method of airfoil based on conditional variational autoencoder
    Wang, Xu
    Qian, Weiqi
    Zhao, Tun
    Chen, Hai
    He, Lei
    Sun, Haisheng
    Tian, Yuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [32] A Network Data Reinforcement Method Based on the Multiclass Variational Autoencoder
    Qu, Yanze
    Ma, Hailong
    Jiang, Yiming
    Wang, Liang
    Yu, Jing
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [33] A boosting resampling method for regression based on a conditional variational autoencoder
    Huang, Yang
    Liu, Duen-Ren
    Lee, Shin-Jye
    Hsu, Chia-Hao
    Liu, Yang-Guang
    INFORMATION SCIENCES, 2022, 590 : 90 - 105
  • [34] A Data Reconstruction Method based on Adversarial Conditional Variational Autoencoder
    Ren, Yifu
    Liu, Jinhai
    Zhang, Jianan
    Jiang, Lin
    Luo, Yanhong
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 622 - 626
  • [35] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations
    Yang, Lin
    Fan, Wentao
    Bouguila, Nizar
    KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [36] ADSVAE: An Adaptive Density-aware Spectral Clustering Method for Multi-omics Data Based on Variational Autoencoder
    Zhao, Jianping
    Guan, Qi
    Zheng, Chunhou
    Cao, Qingqing
    CURRENT BIOINFORMATICS, 2023, 18 (06) : 527 - 536
  • [37] The discrete variational derivative method based on discrete differential forms
    Yaguchi, Takaharu
    Matsuo, Takayasu
    Sugihara, Masaaki
    JOURNAL OF COMPUTATIONAL PHYSICS, 2012, 231 (10) : 3963 - 3986
  • [38] Unified Robust Semi-Supervised Variational Autoencoder
    Chen, Xu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [39] FREPD: A Robust Federated Learning Framework on Variational Autoencoder
    Gu, Zhipin
    He, Liangzhong
    Li, Peiyan
    Sun, Peng
    Shi, Jiangyong
    Yang, Yuexiang
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 39 (03): : 307 - 320
  • [40] Multimode Process Monitoring Based on Modified Density Peak Clustering and Parallel Variational Autoencoder
    Yu, Feng
    Liu, Jianchang
    Liu, Dongming
    MATHEMATICS, 2022, 10 (14)