Dynamic Affine-Invariant Shape-Appearance Handshape Features and Classification in Sign Language Videos

被引:0
|
作者
Roussos, Anastasios [1 ]
Theodorakis, Stavros [2 ]
Pitsikalis, Vassilis [2 ]
Maragos, Petros [2 ]
机构
[1] Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Athens, Greece
关键词
affine-invariant shape-appearance model; landmarks-free shape representation; static and dynamic priors; feature extraction; handshape classification; RECOGNITION; TRACKING; MODEL; MOTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose the novel approach of dynamic affine-invariant shape-appearance model (Aff-SAM) and employ it for handshape classification and sign recognition in sign language (SL) videos. Aff-SAM offers a compact and descriptive representation of hand configurations as well as regularized model-fitting, assisting hand tracking and extracting handshape features. We construct SA images representing the hand's shape and appearance without landmark points. We model the variation of the images by linear combinations of eigenimages followed by affine transformations, accounting for 3D hand pose changes and improving model's compactness. We also incorporate static and dynamic handshape priors, offering robustness in occlusions, which occur often in signing. The approach includes an affine signer adaptation component at the visual level, without requiring training from scratch a new singer-specific model. We rather employ a short development data set to adapt the models for a new signer. Experiments on the Boston-University-400 continuous SL corpus demonstrate improvements on handshape classification when compared to other feature extraction approaches. Supplementary evaluations of sign recognition experiments, are conducted on a multi-signer, 100-sign data set, from the Greek sign language lemmas corpus. These explore the fusion with movement cues as well as signer adaptation of Aff-SAM to multiple signers providing promising results.
引用
收藏
页码:1627 / 1663
页数:37
相关论文
共 8 条
  • [1] AFFINE-INVARIANT MODELING OF SHAPE-APPEARANCE IMAGES APPLIED ON SIGN LANGUAGE HANDSHAPE CLASSIFICATION
    Roussos, Anastasios
    Theodorakis, Stavros
    Pitsikalis, Vassilis
    Maragos, Petros
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1417 - 1420
  • [2] AFFINE-INVARIANT MODELING OF SHAPE-APPEARANCE IMAGES APPLIED ON SIGN LANGUAGE HANDSHAPE CLASSIFICATION
    Roussos, Anastasios
    Theodorakis, Stavros
    Pitsikalis, Vassilis
    Maragos, Petros
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1417 - 1420
  • [3] New features for affine-invariant shape classification
    Dionisio, CRP
    Kim, HY
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2135 - 2138
  • [4] Affine-invariant curve normalization for object shape representation, classification, and retrieval
    Yannis Avrithis
    Yiannis Xirouhakis
    Stefanos Kollias
    Machine Vision and Applications, 2001, 13 : 80 - 94
  • [5] Affine-invariant curve normalization for object shape representation, classification, and retrieval
    Avrithis, Y
    Xirouhakis, Y
    Kollias, S
    MACHINE VISION AND APPLICATIONS, 2001, 13 (02) : 80 - 94
  • [6] New area matrix-based affine-invariant shape features and similarity metrics
    Dionisio, Carlos R. R.
    Kim, Hae Yong
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1725 - +
  • [7] UNSUPERVISED CLASSIFICATION OF EXTREME FACIAL EVENTS USING ACTIVE APPEARANCE MODELS TRACKING FOR SIGN LANGUAGE VIDEOS
    Antonakos, Epameinondas
    Pitsikalis, Vassilis
    Rodomagoulakis, Isidoros
    Maragos, Petros
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1409 - 1412
  • [8] Automatic Annotation and Segmentation of Sign Language Videos: Base-level Features and Lexical Signs Classification
    Chaaban, Hussein
    Gouiffes, Michele
    Braffort, Annelies
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 484 - 491