Skeleton-based Online Sign Language Recognition using Monotonic Attention

被引:0
|
作者
Takayama, Natsuki [1 ]
Benitez-Garcia, Gibran [1 ]
Takahashi, Hiroki [1 ,2 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Chofu, Tokyo, Japan
[2] Univ Electrocommun, Artificial Intelligence Explorat Res Ctr, Chofu, Tokyo, Japan
关键词
Monotonic Attention; Neural Networks; Skeleton-aced Sign Language Recognition;
D O I
10.5220/0010899400003124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-to-sequence models have been successfully applied to improve continuous sign language word recognition in recent years. Although various methods for continuous sign language word recognition have been proposed, these methods assume offline recognition and lack further investigation in online and streaming situations. In this study, skeleton-based continuous sign language word recognition for online situations was investigated. A combination of spatial-temporal graph convolutional networks and recurrent neural networks with soft attention was employed as the base model. Further, three types of monotonic attention techniques were applied to extend the base model for online recognition. The monotonic attention included hard monotonic attention, monotonic chunkwise attention, and monotonic infinite lookback attention. The performance of the proposed models was evaluated in offline and online recognition settings. A conventional Japanese sign language video dataset, including 275 types of isolated word videos and 113 types of sentence videos, was utilized to evaluate the proposed models. The results showed that the effectiveness of monotonic attention to online continuous sign language word recognition.
引用
收藏
页码:601 / 608
页数:8
相关论文
共 50 条
  • [1] Skeleton-Based Data Augmentation for Sign Language Recognition Using Adversarial Learning
    Nakamura, Yuriya
    Jing, Lei
    IEEE ACCESS, 2025, 13 : 15290 - 15300
  • [2] An effective skeleton-based approach for multilingual sign language recognition
    Renjith, S.
    Suresh, M. S. Sumi
    Rashmi, Manazhy
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
  • [3] Hand Graph Topology Selection for Skeleton-based Sign Language Recognition
    Ozdemir, Ogulcan
    Baytas, Inci M.
    Akarun, Lale
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [4] SKIM: Skeleton-Based Isolated Sign Language Recognition With Part Mixing
    Lin, Kezhou
    Wang, Xiaohan
    Zhu, Linchao
    Zhang, Bang
    Yang, Yi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4271 - 4280
  • [5] Asymmetric multi-branch GCN for skeleton-based sign language recognition
    Liu, Yuhong
    Lu, Fei
    Cheng, Xianpeng
    Yuan, Ying
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75293 - 75319
  • [6] Multi-cue temporal modeling for skeleton-based sign language recognition
    Ozdemir, Ogulcan
    Baytas, Inci M.
    Akarun, Lale
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [7] SC2SLR: Skeleton-based Contrast for Sign Language Recognition
    Lyu, Silu
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 404 - 410
  • [8] Skeleton-Based Mutual Action Recognition Using Interactive Skeleton Graph and Joint Attention
    Jia, Xiangze
    Zhang, Ji
    Wang, Zhen
    Luo, Yonglong
    Chen, Fulong
    Yang, Gaoming
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 110 - 116
  • [9] SML: A Skeleton-based multi-feature learning method for sign language recognition
    Deng, Zhiwen
    Leng, Yuquan
    Hu, Jing
    Lin, Zengrong
    Li, Xuerui
    Gao, Qing
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [10] Insight on Attention Modules for Skeleton-Based Action Recognition
    Jiang, Quanyan
    Wu, Xiaojun
    Kittler, Josef
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 242 - 255