Adversarial multi-task deep learning for signer-independent feature representation

被引:3
|
作者
Fang, Yuchun [1 ]
Xiao, Zhengye [1 ]
Cai, Sirui [1 ]
Ni, Lan [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Coll Liberal Arts, Shanghai 200444, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Sign language recognition; Multi-task learning; Deep learning;
D O I
10.1007/s10489-022-03649-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous research has achieved remarkable progress in Sign Language Recognition (SLR). However, for robust open-set SLR applications, it is necessary to solve signer-independent SLR. This paper proposes a novel adversarial multi-task deep learning (MTL) framework that can incorporate multiple modalities for isolated SLR. Employing the identity recognition task as the competition task to the target SLR task, the proposed model can effectively extract signer-independent features by deviating the optimization direction of the competitive task. Furthermore, the proposed adversarial MTL multi-modality framework can jointly incorporate positive and negative task learning with the target task. Combining multi-modality in the adversarial MTL, our model can extract robust signer-independent representation. We evaluate our method on multiple benchmark datasets from different sign languages. The experimental results demonstrate that the proposed adversarial MTL multi-modality model can effectively realize signer-independent SLR by compensation with relevant tasks and competition with irrelevant tasks.
引用
收藏
页码:4380 / 4392
页数:13
相关论文
共 50 条
  • [21] Unsupervised Human Activity Representation Learning with Multi-task Deep Clustering
    Ma, Haojie
    Zhang, Zhijie
    Li, Wenzhong
    Lu, Sanglu
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):
  • [22] Multi-task deep representation learning method for electronic health records
    Yang, Shan
    Zheng, Xiangwei
    Chen, Xuanchi
    Wei, Yi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1188 - 1192
  • [23] Pose-robust and Discriminative Feature Representation by Multi-task Deep Learning for Multi-view Face Recognition
    Seo, Jeong-Jik
    Kim, Hyung-Il
    Ro, Yong Man
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 166 - 171
  • [24] Adversarial Multi-task Learning of Deep Neural Networks for Robust Speech Recognition
    Shinohara, Yusuke
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2369 - 2372
  • [25] Multi-task Learning Deep Neural Networks For Speech Feature Denoising
    Huang, Bin
    Ke, Dengfeng
    Zheng, Hao
    Xu, Bo
    Xu, Yanyan
    Su, Kaile
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2464 - 2468
  • [26] SEMANTICS CONSTRAINED DICTIONARY LEARNING FOR SIGNER-INDEPENDENT SIGN LANGUAGE RECOGNITION
    Yin, Fang
    Chai, Xiujuan
    Zhou, Yu
    Chen, Xilin
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3310 - 3314
  • [27] Learning Task Relational Structure for Multi-Task Feature Learning
    Wang, De
    Nie, Feiping
    Huang, Heng
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1239 - 1244
  • [28] Robust Feature Representation Using Multi-Task Learning for Human Activity Recognition
    Azadi, Behrooz
    Haslgruebler, Michael
    Anzengruber-Tanase, Bernhard
    Sopidis, Georgios
    Ferscha, Alois
    SENSORS, 2024, 24 (02)
  • [29] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141
  • [30] Multi-Stage Multi-Task Feature Learning
    Gong, Pinghua
    Ye, Jieping
    Zhang, Changshui
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2979 - 3010