Generative AI in Multimodal Cross-Lingual Dialogue System for Inclusive Communication Support

被引:0
|
作者
Nataraj, Vidhya [1 ]
Liao, Wen-Hsuan [2 ]
Chang, Yue-Shan [3 ]
Chiang, Chen-Yu [4 ]
Lin, Chao-Yin [5 ]
Lin, Yu-An [5 ]
Day, Min-Yuh [2 ]
机构
[1] Natl Taipei Univ, Smart Healthcare Management, New Taipei, Taiwan
[2] Natl Taipei Univ, Grad Inst Informat Management, New Taipei, Taiwan
[3] Natl Taipei Univ, Dept Comp Sci & Informat Engn, New Taipei, Taiwan
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
[5] Natl Taipei Univ, Dept Social Work, New Taipei, Taiwan
关键词
Generative AI; Large Language Models (LLMs); Multimodal; Cross-lingual; Dialogue System; Inclusive Communication Support;
D O I
10.1109/IRI62200.2024.00051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements in natural language processing have enhanced dialogue systems, making them vital for inclusive technology that facilitates accessible interactions across diverse user needs. However, existing systems often struggle with multimodal inputs, multilingual support, and generating contextually appropriate responses in data-scarce environments. This research addresses these gaps by developing an integrated dialogue system leveraging generative AI models like ChatGPT and multimodal inputs like text, audio, and image. The system utilizes transfer learning and large language models (LLMs) to process multilingual data, generating comprehensive responses tailored to user context. The proposed approach constructs a multimodal cross-lingual task-oriented dialogue system capable of understanding and responding to users in multiple languages and modalities. The proposed multimodal cross-lingual task-oriented dialogue system will enhance functionality and inclusivity compared to traditional unimodal or single-language dialogue systems in providing inclusive communication support. The major research contribution of this study highlights the potential of generative AI in developing accessible dialogue systems that cater to diverse user needs to advance inclusive technology. Practitioner implications of this paper highlight the potential of multimodal cross-lingual dialogue system to foster digital inclusion and inclusive communication support, improving accessibility and equity in human-computer interactions for diverse users.
引用
收藏
页码:204 / 209
页数:6
相关论文
共 50 条
  • [1] The NESPOLE! multimodal interface for cross-lingual communication experience and lessons learned
    Taddei, L
    Costantini, E
    Lavie, A
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 223 - 228
  • [2] Robust Cross-lingual Task-oriented Dialogue
    Xiang, Lu
    Zhu, Junnan
    Zhao, Yang
    Zhou, Yu
    Zong, Chengqing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [3] Cross-lingual Cross-modal Pretraining for Multimodal Retrieval
    Fei, Hongliang
    Yu, Tan
    Li, Ping
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3644 - 3650
  • [4] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
    Gjoreski, Kristijan
    Gjoreski, Aleksandar
    Kraljevski, Ivan
    Hirschfeld, Diane
    INTERSPEECH 2019, 2019, : 1916 - 1920
  • [5] Robust Cross-lingual Dialogue System Based on Multi-granularity Adversarial Training
    Xiang L.
    Zhu J.-N.
    Zhou Y.
    Zong C.-Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (08): : 1855 - 1866
  • [6] ON THE STUDY OF GENERATIVE ADVERSARIAL NETWORKS FOR CROSS-LINGUAL VOICE CONVERSION
    Sisman, Berrak
    Zhang, Mingyang
    Dong, Minghui
    Li, Haizhou
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 144 - 151
  • [7] An Interactive Framework of Cross-Lingual NLU for In-Vehicle Dialogue
    Li, Xinlu
    Fang, Liangkuan
    Zhang, Lexuan
    Cao, Pei
    SENSORS, 2023, 23 (20)
  • [8] Understanding Cross-lingual Pragmatic Misunderstandings in Email Communication
    Lim H.
    Cosley D.
    Fussell S.R.
    Proceedings of the ACM on Human-Computer Interaction, 2022, 6 (CSCW1)
  • [9] Cross-lingual Text Clustering in a Large System
    Schneider, Nicole R.
    Sankaranarayanan, Jagan
    Samet, Hanan
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 1 - 11
  • [10] A system for supporting cross-lingual information retrieval
    Capstick, J
    Diagne, AK
    Erbach, G
    Uszkoreit, H
    Leisenberg, A
    Leisenberg, M
    INFORMATION PROCESSING & MANAGEMENT, 2000, 36 (02) : 275 - 289