PHONETICS EMBEDDING LEARNING WITH SIDE INFORMATION

被引:0
|
作者
Synnaeve, Gabriel [1 ]
Schatz, Thomas [1 ,2 ]
Dupoux, Emmanuel [1 ]
机构
[1] CNRS, EHESS, IEC ENS, LSCP, Paris, France
[2] CNRS, ENS, SIERRA Project Team INRIA, Paris, France
关键词
speech; ABX; deep neural network; side information; semi-supervised; speech embeddings; acoustic model; DISCOVERY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that it is possible to learn an efficient acoustic model using only a small amount of easily available word-level similarity annotations. In contrast to the detailed phonetic labeling required by classical speech recognition technologies, the only information our method requires are pairs of speech excerpts which are known to be similar (same word) and pairs of speech excerpts which are known to be different (different words). An acoustic model is obtained by training shallow and deep neural networks, using an architecture and a cost function well-adapted to the nature of the provided information. The resulting model is evaluated in an ABX minimalpair discrimination task and is shown to perform much better (11.8% ABX error rate) than raw speech features (19.6%), not far from a fully supervised baseline (best neural network: 9.2%, HMM-GMM: 11%).
引用
收藏
页码:106 / 111
页数:6
相关论文
共 50 条
  • [1] Information embedding with distortion side information
    Khisti, Ashish
    Martinian, Emin
    Wornell, Gregory
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 183 - +
  • [2] SINE: Side Information Network Embedding
    Chen, Zitai
    Cai, Tongzhao
    Chen, Chuan
    Zheng, Zibin
    Ling, Guohui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 692 - 708
  • [3] Embedding unstructured side information in product recommendation
    Pourgholamali, Fatemeh
    Kahani, Mohsen
    Bagheri, Ebrahim
    Noorian, Zeinab
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2017, 25 : 70 - 85
  • [4] Online Learning with Side Information
    Xu, Xiao
    Vakili, Sattar
    Zhao, Qing
    Swami, Ananthram
    MILCOM 2017 - 2017 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2017, : 303 - 308
  • [5] Learning Preferences with Side Information
    Farias, Vivek F.
    Li, Andrew A.
    MANAGEMENT SCIENCE, 2019, 65 (07) : 3131 - 3149
  • [6] Video Summarization by Learning Deep Side Semantic Embedding
    Yuan, Yitian
    Mei, Tao
    Cui, Peng
    Zhu, Wenwu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) : 226 - 237
  • [7] The duality between information embedding and source coding with side information and some applications
    Barron, RJ
    Chen, B
    Wornell, GW
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2003, 49 (05) : 1159 - 1180
  • [8] Learning with side information: PAC learning bounds
    Kuusela, P
    Ocone, D
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2004, 68 (03) : 521 - 545
  • [9] Learning Network Embedding with Community Structural Information
    Li, Yu
    Wang, Ying
    Zhang, Tingting
    Zhang, Jiawei
    Chang, Yi
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2937 - 2943
  • [10] Embedding Learning with Events in Heterogeneous Information Networks
    Gui, Huan
    Liu, Jialu
    Tao, Fangbo
    Jiang, Meng
    Norick, Brandon
    Kaplan, Lance
    Han, Jiawei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (11) : 2428 - 2441