Dynamic Affine-Invariant Shape-Appearance Handshape Features and Classification in Sign Language Videos

被引：0

作者：

Roussos, Anastasios ^{[1
]}

Theodorakis, Stavros ^{[2
]}

Pitsikalis, Vassilis ^{[2
]}

Maragos, Petros ^{[2
]}

机构：

[1] Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England

[2] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Athens, Greece

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2013年 / 14卷

关键词：

affine-invariant shape-appearance model; landmarks-free shape representation; static and dynamic priors; feature extraction; handshape classification; RECOGNITION; TRACKING; MODEL; MOTION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose the novel approach of dynamic affine-invariant shape-appearance model (Aff-SAM) and employ it for handshape classification and sign recognition in sign language (SL) videos. Aff-SAM offers a compact and descriptive representation of hand configurations as well as regularized model-fitting, assisting hand tracking and extracting handshape features. We construct SA images representing the hand's shape and appearance without landmark points. We model the variation of the images by linear combinations of eigenimages followed by affine transformations, accounting for 3D hand pose changes and improving model's compactness. We also incorporate static and dynamic handshape priors, offering robustness in occlusions, which occur often in signing. The approach includes an affine signer adaptation component at the visual level, without requiring training from scratch a new singer-specific model. We rather employ a short development data set to adapt the models for a new signer. Experiments on the Boston-University-400 continuous SL corpus demonstrate improvements on handshape classification when compared to other feature extraction approaches. Supplementary evaluations of sign recognition experiments, are conducted on a multi-signer, 100-sign data set, from the Greek sign language lemmas corpus. These explore the fusion with movement cues as well as signer adaptation of Aff-SAM to multiple signers providing promising results.

引用

页码：1627 / 1663

页数：37

共 8 条

[1] AFFINE-INVARIANT MODELING OF SHAPE-APPEARANCE IMAGES APPLIED ON SIGN LANGUAGE HANDSHAPE CLASSIFICATION
Roussos, Anastasios
Theodorakis, Stavros
Pitsikalis, Vassilis
Maragos, Petros
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1417 - 1420
[2] AFFINE-INVARIANT MODELING OF SHAPE-APPEARANCE IMAGES APPLIED ON SIGN LANGUAGE HANDSHAPE CLASSIFICATION
Roussos, Anastasios
Theodorakis, Stavros
Pitsikalis, Vassilis
Maragos, Petros
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1417 - 1420
[3] New features for affine-invariant shape classification
Dionisio, CRP
Kim, HY
ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2135 - 2138
[4] Affine-invariant curve normalization for object shape representation, classification, and retrieval
Yannis Avrithis
Yiannis Xirouhakis
Stefanos Kollias
Machine Vision and Applications, 2001, 13 : 80 - 94
[5] Affine-invariant curve normalization for object shape representation, classification, and retrieval
Avrithis, Y
Xirouhakis, Y
Kollias, S
MACHINE VISION AND APPLICATIONS, 2001, 13 (02) : 80 - 94
[6] New area matrix-based affine-invariant shape features and similarity metrics
Dionisio, Carlos R. R.
Kim, Hae Yong
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1725 - +
[7] UNSUPERVISED CLASSIFICATION OF EXTREME FACIAL EVENTS USING ACTIVE APPEARANCE MODELS TRACKING FOR SIGN LANGUAGE VIDEOS
Antonakos, Epameinondas
Pitsikalis, Vassilis
Rodomagoulakis, Isidoros
Maragos, Petros
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1409 - 1412
[8] Automatic Annotation and Segmentation of Sign Language Videos: Base-level Features and Lexical Signs Classification
Chaaban, Hussein
Gouiffes, Michele
Braffort, Annelies
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 484 - 491

← 1 →