A shapelet-based framework for large-scale word-level sign language database auto-construction

被引:0
|
作者
Ma, Xiang [1 ]
Wang, Qiang [1 ]
Zheng, Tianyou [1 ]
Yuan, Lin [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 01期
基金
中国国家自然科学基金;
关键词
Sign language; Shapelet; Self-learning; Big data computing; GLOBAL BURDEN; RECOGNITION; COMBINATION; DESCRIPTOR; ATTENTION; DISTANCE; NETWORK;
D O I
10.1007/s00521-022-08018-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sign language recognition is a challenging and often underestimated problem that includes the asynchronous integration of multimodal articulators. Learning powerful applied statistical models requires much training data. However, well-labelled sign language databases are a scarce resource due to the high cost of manual labelling and performing. On the other hand, there exist a lot of sign language-interpreted videos on the Internet. This work aims to propose a framework to automatically learn a large-scale sign language database from sign language-interpreted videos. We achieved this by exploring the correspondence between subtitles and motions by discovering shapelets which are the most discriminative subsequences within the data sequences. In this paper, two modified shapelet methods were used to identify the target signs for 1000 words from 89 (96 h, 8 naive signers) sign language-interpreted videos in terms of brute force search and parameter learning. Then, an augmented (3-5 times larger) large-scale word-level sign database was finally constructed using an adaptive sample augmentation strategy that collected all similar video clips of the target sign as valid samples. Experiments on a subset of 100 words revealed a considerable speedup and 14% improvement in recall rate. The evaluation of three state-of-the-art sign language classifiers demonstrates the good discrimination of the database, and the sample augmentation strategy can significantly increase the recognition accuracy of all classifiers by 10-33% by increasing the number, variety, and balance of the data.
引用
收藏
页码:253 / 274
页数:22
相关论文
共 19 条
  • [1] A shapelet-based framework for large-scale word-level sign language database auto-construction
    Xiang Ma
    Qiang Wang
    Tianyou Zheng
    Lin Yuan
    Neural Computing and Applications, 2023, 35 : 253 - 274
  • [2] Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison
    Li, Dongxu
    Opazo, Cristian Rodriguez
    Yu, Xin
    Li, Hongdong
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1448 - 1458
  • [3] Sign Pose-based Transformer for Word-level Sign Language Recognition
    Bohacek, Matyas
    Hruz, Marek
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 182 - 191
  • [4] 3D gesture segmentation for word-level Arabic sign language using large-scale RGB video sequences and autoencoder convolutional networks
    Boukdir, Abdelbasset
    Benaddy, Mohamed
    Ellahyani, Ayoub
    El Meslouhi, Othmane
    Kardouchi, Mustapha
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) : 2055 - 2062
  • [5] 3D gesture segmentation for word-level Arabic sign language using large-scale RGB video sequences and autoencoder convolutional networks
    Abdelbasset Boukdir
    Mohamed Benaddy
    Ayoub Ellahyani
    Othmane El Meslouhi
    Mustapha Kardouchi
    Signal, Image and Video Processing, 2022, 16 : 2055 - 2062
  • [6] Construction and Application of a Large-Scale Chinese Abstractness Lexicon Based on Word Similarity
    Xu, Huidan
    Yang, Lijiao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 122 - 130
  • [7] A Word Distributed Representation Based Framework for Large-scale Short Text Classification
    Yao, Di
    Bi, Jingping
    Huang, Jianhui
    Zhu, Jin
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [8] A study of the construction of a large-scale water quality spatial database based on ArcSDE
    Xu Shuna
    Yang Lingbin
    Zhang Xia
    Wu Jin
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 279 - +
  • [9] Construction of Adverbial-Verb Collocation Database Based on Large-Scale Corpus
    Xing, Dan
    Xun, Endong
    Wang, Chengwen
    Rao, Gaoqi
    Ma, Luyao
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 585 - 595
  • [10] The Challenges of Large-Scale, Web-Based Language Datasets: Word Length and Predictability Revisited
    Meylan, Stephan C.
    Griffiths, Thomas L.
    COGNITIVE SCIENCE, 2021, 45 (06)