Skeleton-based Online Sign Language Recognition using Monotonic Attention

被引:0
|
作者
Takayama, Natsuki [1 ]
Benitez-Garcia, Gibran [1 ]
Takahashi, Hiroki [1 ,2 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Chofu, Tokyo, Japan
[2] Univ Electrocommun, Artificial Intelligence Explorat Res Ctr, Chofu, Tokyo, Japan
关键词
Monotonic Attention; Neural Networks; Skeleton-aced Sign Language Recognition;
D O I
10.5220/0010899400003124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-to-sequence models have been successfully applied to improve continuous sign language word recognition in recent years. Although various methods for continuous sign language word recognition have been proposed, these methods assume offline recognition and lack further investigation in online and streaming situations. In this study, skeleton-based continuous sign language word recognition for online situations was investigated. A combination of spatial-temporal graph convolutional networks and recurrent neural networks with soft attention was employed as the base model. Further, three types of monotonic attention techniques were applied to extend the base model for online recognition. The monotonic attention included hard monotonic attention, monotonic chunkwise attention, and monotonic infinite lookback attention. The performance of the proposed models was evaluated in offline and online recognition settings. A conventional Japanese sign language video dataset, including 275 types of isolated word videos and 113 types of sentence videos, was utilized to evaluate the proposed models. The results showed that the effectiveness of monotonic attention to online continuous sign language word recognition.
引用
收藏
页码:601 / 608
页数:8
相关论文
共 50 条
  • [11] Memory Attention Networks for Skeleton-Based Action Recognition
    Li, Ce
    Xie, Chunyu
    Zhang, Baochang
    Han, Jungong
    Zhen, Xiantong
    Chen, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4800 - 4814
  • [12] Memory Attention Networks for Skeleton-based Action Recognition
    Xie, Chunyu
    Li, Ce
    Zhang, Baochang
    Chen, Chen
    Han, Jungong
    Liu, Jianzhuang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1639 - 1645
  • [13] Bidirectional Skeleton-Based Isolated Sign Recognition using Graph Convolutional Networks
    Dafnis, Konstantinos M.
    Chroni, Evgenia
    Neidle, Carol
    Metaxas, Dimitris N.
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7328 - 7338
  • [14] CoSign: Exploring Co-occurrence Signals in Skeleton-based Continuous Sign Language Recognition
    Jiao, Peiqi
    Min, Yuecong
    Li, Yanan
    Wang, Xiaotao
    Lei, Lei
    Chen, Xilin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20619 - 20629
  • [15] Improved skeleton-based activity recognition using convolutional block attention module
    Qin, Jing
    Zhang, Shugang
    Wang, Yiguo
    Yang, Fei
    Zhong, Xin
    Lu, Weigang
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [16] Attention Relational Network for Skeleton-Based Group Activity Recognition
    Wang, Chuanchuan
    Mohamed, Ahmad Sufril Azlan
    IEEE ACCESS, 2023, 11 : 129230 - 129239
  • [17] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
    Zhang, Yujie
    Cai, Haibin
    ELECTRONICS, 2023, 12 (07)
  • [18] Skeleton-Based Attention Mask for Pedestrian Attribute Recognition Network
    Sooksatra, Sorn
    Rujikietgumjorn, Sitapa
    JOURNAL OF IMAGING, 2021, 7 (12)
  • [19] Skeleton-based Chinese sign language recognition and generation for bidirectional communication between deaf and hearing people
    Xiao, Qinkun
    Qin, Minying
    Yin, Yuting
    NEURAL NETWORKS, 2020, 125 : 41 - 55
  • [20] Spatio-temporal segments attention for skeleton-based action recognition
    Qiu, Helei
    Hou, Biao
    Ren, Bo
    Zhang, Xiaohua
    NEUROCOMPUTING, 2023, 518 : 30 - 38