Cultural Self-Adaptive Multimodal Gesture Generation Based on Multiple Culture Gesture Dataset

被引:1
|
作者
Wu, Jingyu [1 ]
Chen, Shi [2 ]
Gan, Shuyu [1 ]
Li, Weijun [1 ]
Yang, Changyuan [1 ]
Sun, Lingyun [2 ]
机构
[1] Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Singapore Innovat & AI Joint Res Lab, Hangzhou, Zhejiang, Peoples R China
关键词
co-speech gesture generation; datasets; multimodal chatbots; evaluation metric; nonverbal behavior; SPEECH; LANGUAGE;
D O I
10.1145/3581783.3611705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-speech gesture generation is essential for multimodal chatbots and agents. Previous research extensively studies the relationship between text, audio, and gesture. Meanwhile, to enhance cross-culture communication, culture-specific gestures are crucial for chatbots to learn cultural differences and incorporate cultural cues. However, culture-specific gesture generation faces two challenges: lack of large-scale, high-quality gesture datasets that include diverse cultural groups, and lack of generalization across different cultures. Therefore, in this paper, we first introduce a Multiple Culture Gesture Dataset (MCGD), the largest freely available gesture dataset to date. It consists of ten different cultures, over 200 speakers, and 10,000 segmented sequences. We further propose a Cultural Self-adaptive Gesture Generation Network (CSGN) that takes multimodal relationships into consideration while generating gestures using a cascade architecture and learnable dynamic weight. The CSGN adaptively generates gestures with different cultural characteristics without the need to retrain a new network. It extracts cultural features from the multimodal inputs or a cultural style embedding space with a designated culture. We broadly evaluate our method across four large-scale benchmark datasets. Empirical results show that our method achieves multiple cultural gesture generation and improves comprehensiveness of multimodal inputs. Our method improves the state-of-the-art average FGD from 53.7 to 48.0 and culture deception rate (CDR) from 33.63% to 39.87%.
引用
收藏
页码:3538 / 3549
页数:12
相关论文
共 50 条
  • [41] Multimodal Gesture Recognition Based on the ResC3D Network
    Miao, Qiguang
    Li, Yunan
    Ouyang, Wanli
    Ma, Zhenxin
    Xu, Xin
    Shi, Weikang
    Cao, Xiaochun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3047 - 3055
  • [42] A Multimodal Fusion Model Based on Hybrid Attention Mechanism for Gesture Recognition
    Li, Yajie
    Chen, Yiqiang
    Gu, Yang
    Ouyang, Jianquan
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 302 - 312
  • [43] A Hand Gesture Based Transceiver System for Multiple Application
    Panchal, Parth B.
    Nayak, Vimal H.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 679 - U1938
  • [44] Hand Gesture Recognition Based on Cascading of Multiple Features
    Gudavalli, Madhavi
    Mohan, C. Krishna
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS), 2018, : 28 - 34
  • [45] Gesture recognition based on SOM using multiple sensors
    Ishikawa, M
    Sasaki, N
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1300 - 1304
  • [46] A Method of Hand Gesture Recognition based on Multiple Sensors
    Fan Wei
    Chen Xiang
    Wang Wen-hui
    Zhang Xu
    Yang Ji-hai
    Lantz, Vuokko
    Wang Kong-qiao
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [47] Adaptive Threshold Gesture Segmentation Algorithm Based on Skin Color
    Liu, Chengyuan
    Wang, Jingqiu
    Zhang, Ting
    Ding, Dongsheng
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1619 - 1622
  • [48] Goal-Based Automated Code Generation in Self-Adaptive System
    Lee, Joonhoon
    Park, Jeongmin
    Yoo, Giljong
    Lee, Eunseok
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (06) : 1118 - 1129
  • [49] Goal-Based Automated Code Generation in Self-Adaptive System
    Joonhoon Lee
    Jeongmin Park
    Giljong Yoo
    Eunseok Lee
    Journal of Computer Science and Technology, 2010, 25 : 1118 - 1129
  • [50] An MDE-based approach for self-adaptive RTES model generation
    Ben Said, Mouna
    Hadj Kacem, Yessine
    Kerboeuf, Mickael
    Abid, Mohamed
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 925 - 951