PHONETICS EMBEDDING LEARNING WITH SIDE INFORMATION

被引:0
|
作者
Synnaeve, Gabriel [1 ]
Schatz, Thomas [1 ,2 ]
Dupoux, Emmanuel [1 ]
机构
[1] CNRS, EHESS, IEC ENS, LSCP, Paris, France
[2] CNRS, ENS, SIERRA Project Team INRIA, Paris, France
关键词
speech; ABX; deep neural network; side information; semi-supervised; speech embeddings; acoustic model; DISCOVERY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that it is possible to learn an efficient acoustic model using only a small amount of easily available word-level similarity annotations. In contrast to the detailed phonetic labeling required by classical speech recognition technologies, the only information our method requires are pairs of speech excerpts which are known to be similar (same word) and pairs of speech excerpts which are known to be different (different words). An acoustic model is obtained by training shallow and deep neural networks, using an architecture and a cost function well-adapted to the nature of the provided information. The resulting model is evaluated in an ABX minimalpair discrimination task and is shown to perform much better (11.8% ABX error rate) than raw speech features (19.6%), not far from a fully supervised baseline (best neural network: 9.2%, HMM-GMM: 11%).
引用
收藏
页码:106 / 111
页数:6
相关论文
共 50 条
  • [21] GSIRec: Learning with graph side information for recommendation
    Anchen Li
    Bo Yang
    World Wide Web, 2021, 24 : 1411 - 1437
  • [22] Learning to Augment with Feature Side-information
    Mollaysa, Amina
    Kalousis, Alexandros
    Bruno, Eric
    Diephuis, Maurits
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 173 - 187
  • [23] Expedited Online Learning With Spatial Side Information
    Thangeda, Pranay
    Ornik, Melkior
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1479 - 1491
  • [24] Partial label learning with noisy side information
    Wang, Shaokai
    Xia, Mingxuan
    Wang, Zilong
    Lyu, Gengyu
    Feng, Songhe
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12382 - 12396
  • [25] Learning with Side Information through Modality Hallucination
    Hoffman, Judy
    Gupta, Saurabh
    Darrell, Trevor
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 826 - 834
  • [26] GSIRec: Learning with graph side information for recommendation
    Li, Anchen
    Yang, Bo
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (05): : 1411 - 1437
  • [27] Structure Learning with Side Information: Sample Complexity
    Sihag, Saurabh
    Tajer, Ali
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Hypergraph contrastive learning for recommendation with side information
    Ao, Dun
    Cao, Qian
    Wang, Xiaofeng
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2024, 17 (04) : 657 - 670
  • [29] Efficiently learning the metric with side-information
    De Bie, T
    Momma, M
    Cristianini, N
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2003, 2842 : 175 - 189
  • [30] Output Regularized Metric Learning with Side Information
    Liu, Wei
    Hoi, Steven C. H.
    Liu, Jianzhuang
    COMPUTER VISION - ECCV 2008, PT III, PROCEEDINGS, 2008, 5304 : 358 - +