Generalizable Low-Resource Activity Recognition with Diverse and Discriminative Representation Learning

被引:5
|
作者
Qin, Xin [1 ]
Wang, Jindong [2 ]
Ma, Shuo [1 ]
Lu, Wang [1 ]
Zhu, Yongchun [1 ]
Xie, Xing [2 ]
Chen, Yiqiang [1 ]
机构
[1] Chinese Acad Sci, Beijing Key Lab Mobile Com, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
HumanActivity Recognition; Domain Generalization; Low-Resource;
D O I
10.1145/3580305.3599360
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity recognition (HAR) is a time series classification task that focuses on identifying the motion patterns from human sensor readings. Adequate data is essential but a major bottleneck for training a generalizable HAR model, which assists customization and optimization of online web applications. However, it is costly in time and economy to collect large-scale labeled data in reality, i.e., the low-resource challenge. Meanwhile, data collected from different persons have distribution shifts due to different living habits, body shapes, age groups, etc. The low-resource and distribution shift challenges are detrimental to HAR when applying the trained model to new unseen subjects. In this paper, we propose a novel approach called Diverse and Discriminative representation Learning (DDLearn) for generalizable low-resource HAR. DDLearn simultaneously considers diversity and discrimination learning. With the constructed self-supervised learning task, DDLearn enlarges the data diversity and explores the latent activity properties. Then, we propose a diversity preservation module to preserve the diversity of learned features by enlarging the distribution divergence between the original and augmented domains. Meanwhile, DDLearn also enhances semantic discrimination by learning discriminative representations with supervised contrastive learning. Extensive experiments on three public HAR datasets demonstrate that our method significantly outperforms state-of-art methods by an average accuracy improvement of 9.5% under the low-resource distribution shift scenarios, while being a generic, explainable, and flexible framework. Code is available at: https://github.com/microsoft/robustlearn.
引用
收藏
页码:1943 / 1953
页数:11
相关论文
共 50 条
  • [31] Enrollment in low-resource speech recognition systems
    Deligne, S
    Dharanipragada, S
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 341 - 344
  • [32] Image-Mediated Data Augmentation for Low-Resource Human Activity Recognition
    Wang, Zihao
    Qu, Youli
    Tao, Junru
    Song, Yudan
    PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), 2019, : 49 - 54
  • [33] EXPLORING SELF-SUPERVISED REPRESENTATION LEARNING FOR LOW-RESOURCE MEDICAL IMAGE ANALYSIS
    Chattopadhyay, Soumitri
    Ganguly, Soham
    Chaudhury, Sreejit
    Nag, Sayan
    Chattopadhyay, Samiran
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1440 - 1444
  • [34] Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition
    Feng, Siyuan
    Tu, Ming
    Xia, Rui
    Huang, Chuanzeng
    Wang, Yuxuan
    INTERSPEECH 2023, 2023, : 1384 - 1388
  • [35] Towards Discriminative Representation Learning for Speech Emotion Recognition
    Li, Runnan
    Wu, Zhiyong
    Jia, Jia
    Bu, Yaohua
    Zhao, Sheng
    Meng, Helen
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5060 - 5066
  • [36] Erratum to: Latent discriminative representation learning for speaker recognition
    Duolin Huang
    Qirong Mao
    Zhongchen Ma
    Zhishen Zheng
    Sidheswar Routray
    Elias-Nii-Noi Ocquaye
    Frontiers of Information Technology & Electronic Engineering, 2021, 22 : 914 - 914
  • [37] Sparse Representation for Face Recognition based on Discriminative Low-Rank Dictionary Learning
    Ma, Long
    Wang, Chunheng
    Xiao, Baihua
    Zhou, Wen
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2586 - 2593
  • [38] DEEP MAXOUT NETWORKS FOR LOW-RESOURCE SPEECH RECOGNITION
    Miao, Yajie
    Metze, Florian
    Rawat, Shourabh
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 398 - 403
  • [39] Speech recognition datasets for low-resource Congolese languages
    Kimanuka, Ussen
    Maina, Ciira wa
    Buyuk, Osman
    DATA IN BRIEF, 2024, 52
  • [40] Meta Learning for Low-Resource Molecular Optimization
    Wang, Jiahao
    Zheng, Shuangjia
    Chen, Jianwen
    Yang, Yuedong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (04) : 1627 - 1636