MODEL-LEVEL DATA-DRIVEN SUB-UNITS FOR SIGNS IN VIDEOS OF CONTINUOUS SIGN LANGUAGE

被引:11
|
作者
Theodorakis, Stavros [1 ]
Pitsikalis, Vassilis [1 ]
Maragos, Petros [1 ]
机构
[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece
关键词
sign language; subunit modeling; HMM;
D O I
10.1109/ICASSP.2010.5495875
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We investigate the issue of sign language automatic phonetic subunit modeling, that is completely data driven and without any prior phonetic information. A first step of visual processing leads to simple and effective region-based visual features. Prior to the sub-unit modeling we propose to employ a pronunciation clustering step with respect to each sign. Afterwards, for each sign and pronunciation group we find the time segmentation at the hidden Markov model (HMM) level. The models employed refer to movements as a sequence of dominant hand positions. The constructed segments are exploited explicitly at the model level via hierarchical clustering of HMMs and lead to the data-driven movement sub-unit construction. The constructed movement sub-units are evaluated in qualitative analysis experiments on data from the Boston University (BU)-400 American Sign Language corpus showing promising results.
引用
收藏
页码:2262 / 2265
页数:4
相关论文
共 50 条
  • [1] Data-Driven Sub-Units and Modeling Structure for Continuous Sign Language Recognition with Multiple-Cues
    Pitsikalis, Vassilis
    Theodorakis, Stavros
    Maragos, Petros
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A196 - A203
  • [2] Sign Language Recognition using Sub-Units
    Cooper, Helen
    Ong, Eng-Jon
    Pugeault, Nicolas
    Bowden, Richard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2205 - 2231
  • [3] Sign language recognition using sub-units
    Cooper, Helen
    Ong, Eng-Jon
    Pugeault, Nicolas
    Bowden, Richard
    Journal of Machine Learning Research, 2012, 13 : 2205 - 2231
  • [4] Sign Language Recognition using Linguistically Derived Sub-Units
    Cooper, Helen
    Bowden, Richard
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A57 - A60
  • [5] A Data-Driven Representation for Sign Language Production
    Walsh, Harry
    Ravanshad, Abolfazl
    Rahmani, Mariam
    Bowden, Richard
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [6] Video retrieval in sign language videos: how to model and compare signs ?
    Lefebvre-Albaret, F.
    Dalle, P.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3049 - 3054
  • [7] USING DATA-DRIVEN SUBWORD UNITS IN LANGUAGE MODEL OF HIGHLY INFLECTIVE SLOVENIAN LANGUAGE
    Maucec, Mirjam Sepesy
    Rotovnik, Tomaz
    Kacic, Zdravko
    Brest, Janez
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2009, 23 (02) : 287 - 312
  • [8] MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
    Ma, Jian
    Wang, Wenguan
    Yang, Yi
    Zheng, Feng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 7241 - 7254
  • [9] Data-driven development of Virtual Sign Language Communication Agents
    Brock, Heike
    Balayn, Agathe
    Nakadai, Kazuhiro
    2018 27TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2018), 2018, : 370 - 377
  • [10] A data-driven approach to the semantics of iconicity in American Sign Language and English
    Thompson, Bill
    Perlman, Marcus
    Lupyan, Gary
    Sevcikova Sehyr, Zed
    Emmorey, Karen
    LANGUAGE AND COGNITION, 2020, 12 (01) : 182 - 202