Additive Phoneme-aware Margin Softmax Loss for Language Recognition

被引:4
|
作者
Li, Zheng [1 ]
Liu, Yan [1 ]
Li, Lin [1 ]
Hong, Qingyang [2 ]
机构
[1] Xiamen Univ, Sch Elect Sci & Engn, Xiamen, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
language recognition; oriental language recognition; margin loss; phonetic information; SPEAKER;
D O I
10.21437/Interspeech.2021-1167
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper proposes an additive phoneme-aware margin softmax (APM-Softmax) loss to train the multi-task learning network with phonetic information for language recognition. In additive margin softmax (AM-Softmax) loss, the margin is set as a constant during the entire training for all training samples, and that is a suboptimal method since the recognition difficulty varies in training samples. In additive angular margin softmax (AAM-Softmax) loss, the additional angular margin is set as a costant as well. In this paper, we propose an APM-Softmax loss for language recognition with phoneitc multi-task learning, in which the additive phoneme-aware margin is automatically tuned for different training samples. More specifically, the margin of language recognition is adjusted according to the results of phoneme recognition. Experiments are reported on Oriental Language Recognition (OLR) datasets, and the proposed method improves AM-Softmax loss and AAM-Softmax loss in different language recognition testing conditions.
引用
收藏
页码:3276 / 3280
页数:5
相关论文
共 50 条
  • [21] ArcFace: Additive Angular Margin Loss for Deep Face Recognition
    Deng, Jiankang
    Guo, Jia
    Yang, Jing
    Xue, Niannan
    Kotsia, Irene
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 5962 - 5979
  • [22] QAMFACE: QUADRATIC ADDITIVE ANGULAR MARGIN LOSS FOR FACE RECOGNITION
    Zhao, He
    Shi, Yongjie
    Tong, Xin
    Ying, Xianghua
    Zha, Hongbin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2141 - 2145
  • [23] KappaFace: Adaptive Additive Angular Margin Loss for Deep Face Recognition
    Oinar, Chingis
    M. Le, Binh
    Woo, Simon S.
    IEEE ACCESS, 2023, 11 : 137138 - 137150
  • [24] MaaFace: Multiplicative and Additive Angular Margin Loss for Deep Face Recognition
    Liu, Weilun
    Jiao, Jichao
    Mo, Yaokai
    Jiao, Jian
    Deng, Zhongliang
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 642 - 653
  • [25] ID-Softmax: A Softmax-like Loss for ID Face Recognition
    Kong, Yan
    Wu, Fuzhang
    Huang, Feiyue
    Wu, Yanjun
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 412 - 419
  • [26] A Loss Function Base on Softmax for Expression Recognition
    Lu, Jin
    Wu, Bo
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [27] Large-Margin Softmax Loss for Convolutional Neural Networks
    Liu, Weiyang
    Wen, Yandong
    Yu, Zhiding
    Yang, Meng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [28] Investigation of Large-Margin Softmax in Neural Language Modeling
    Huo, Jingjing
    Gao, Yingbo
    Wang, Weiyue
    Schlueter, Ralf
    Ney, Hermann
    INTERSPEECH 2020, 2020, : 3645 - 3649
  • [29] Phoneme-Aware Adaptation with Discrepancy Minimization and Dynamically-Classified Vector for Text-independent Speaker Verification
    Wang, Jia
    Lan, Tianhao
    Chen, Jie
    Luo, Chengwen
    Wu, Chao
    Li, Jianqiang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6737 - 6745
  • [30] Combined angular margin and cosine margin softmax loss for music classification based on spectrograms
    Li, Jingxian
    Han, Lixin
    Wang, Yang
    Yuan, Baohua
    Yuan, Xiaofeng
    Yang, Yi
    Yan, Hong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10337 - 10353