Skeleton-based Online Sign Language Recognition using Monotonic Attention

被引:0
|
作者
Takayama, Natsuki [1 ]
Benitez-Garcia, Gibran [1 ]
Takahashi, Hiroki [1 ,2 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Chofu, Tokyo, Japan
[2] Univ Electrocommun, Artificial Intelligence Explorat Res Ctr, Chofu, Tokyo, Japan
关键词
Monotonic Attention; Neural Networks; Skeleton-aced Sign Language Recognition;
D O I
10.5220/0010899400003124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-to-sequence models have been successfully applied to improve continuous sign language word recognition in recent years. Although various methods for continuous sign language word recognition have been proposed, these methods assume offline recognition and lack further investigation in online and streaming situations. In this study, skeleton-based continuous sign language word recognition for online situations was investigated. A combination of spatial-temporal graph convolutional networks and recurrent neural networks with soft attention was employed as the base model. Further, three types of monotonic attention techniques were applied to extend the base model for online recognition. The monotonic attention included hard monotonic attention, monotonic chunkwise attention, and monotonic infinite lookback attention. The performance of the proposed models was evaluated in offline and online recognition settings. A conventional Japanese sign language video dataset, including 275 types of isolated word videos and 113 types of sentence videos, was utilized to evaluate the proposed models. The results showed that the effectiveness of monotonic attention to online continuous sign language word recognition.
引用
收藏
页码:601 / 608
页数:8
相关论文
共 50 条
  • [41] View transform graph attention recurrent networks for skeleton-based action recognition
    Qingqing Huang
    Fengyu Zhou
    Runze Qin
    Yang zhao
    Signal, Image and Video Processing, 2021, 15 : 599 - 606
  • [42] View transform graph attention recurrent networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    Qin, Runze
    Zhao, Yang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (03) : 599 - 606
  • [43] Graph transformer network with temporal kernel attention for skeleton-based action recognition
    Liu, Yanan
    Zhang, Hao
    Xu, Dan
    He, Kangjian
    KNOWLEDGE-BASED SYSTEMS, 2022, 240
  • [44] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [45] Skeleton-based Action Recognition for Human-Robot Interaction using Self-Attention Mechanism
    Bandi, Chaitanya
    Thomas, Ulrike
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [46] Skeleton-based Human Action Recognition using Basis Vectors
    Asteriadis, Stylianos
    Daras, Petros
    8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [47] Language-guided temporal primitive modeling for skeleton-based action recognition
    Pan, Qingzhe
    Xie, Xuemei
    NEUROCOMPUTING, 2025, 613
  • [48] DHF-SLR: Dual-Hand Multi-Stream Fusion Network for Skeleton-Based Sign Language Recognition
    Zhang, Meiqi
    Gao, Qing
    Ju, Zhaojie
    2024 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS, ICARM 2024, 2024, : 649 - 654
  • [49] Hierarchical graph attention network with pseudo-metapath for skeleton-based action recognition
    Wang, Mingdao
    Li, XueMing
    Zhang, Xianlin
    Zhang, Yue
    NEUROCOMPUTING, 2022, 501 : 822 - 833
  • [50] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6