Generative AI in Multimodal Cross-Lingual Dialogue System for Inclusive Communication Support

被引:0
|
作者
Nataraj, Vidhya [1 ]
Liao, Wen-Hsuan [2 ]
Chang, Yue-Shan [3 ]
Chiang, Chen-Yu [4 ]
Lin, Chao-Yin [5 ]
Lin, Yu-An [5 ]
Day, Min-Yuh [2 ]
机构
[1] Natl Taipei Univ, Smart Healthcare Management, New Taipei, Taiwan
[2] Natl Taipei Univ, Grad Inst Informat Management, New Taipei, Taiwan
[3] Natl Taipei Univ, Dept Comp Sci & Informat Engn, New Taipei, Taiwan
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
[5] Natl Taipei Univ, Dept Social Work, New Taipei, Taiwan
关键词
Generative AI; Large Language Models (LLMs); Multimodal; Cross-lingual; Dialogue System; Inclusive Communication Support;
D O I
10.1109/IRI62200.2024.00051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements in natural language processing have enhanced dialogue systems, making them vital for inclusive technology that facilitates accessible interactions across diverse user needs. However, existing systems often struggle with multimodal inputs, multilingual support, and generating contextually appropriate responses in data-scarce environments. This research addresses these gaps by developing an integrated dialogue system leveraging generative AI models like ChatGPT and multimodal inputs like text, audio, and image. The system utilizes transfer learning and large language models (LLMs) to process multilingual data, generating comprehensive responses tailored to user context. The proposed approach constructs a multimodal cross-lingual task-oriented dialogue system capable of understanding and responding to users in multiple languages and modalities. The proposed multimodal cross-lingual task-oriented dialogue system will enhance functionality and inclusivity compared to traditional unimodal or single-language dialogue systems in providing inclusive communication support. The major research contribution of this study highlights the potential of generative AI in developing accessible dialogue systems that cater to diverse user needs to advance inclusive technology. Practitioner implications of this paper highlight the potential of multimodal cross-lingual dialogue system to foster digital inclusion and inclusive communication support, improving accessibility and equity in human-computer interactions for diverse users.
引用
收藏
页码:204 / 209
页数:6
相关论文
共 50 条
  • [41] A Multimodal Dialogue System for Medical Decision Support in Virtual Reality
    Prange, Alexander
    Chikobava, Margarita
    Poller, Peter
    Barz, Michael
    Sonntag, Daniel
    18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 23 - 26
  • [42] Design and development of a multimodal dialogue system to support university tutorship
    Callejas, Zoraida
    Ma Gutierrez, Ana
    Griol, David
    Lopez-Cozar, Ramon
    Abalos, Nieves
    Espejo, Gonzalo
    WORKSHOP PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS, 2010, 8 : 79 - 85
  • [43] Multimodal Cross-Lingual Summarization for Videos: A Revisit in Knowledge Distillation Induced Triple-Stage Training Method
    Liu, Nayu
    Wei, Kaiwen
    Yang, Yong
    Tao, Jianhua
    Sun, Xian
    Yao, Fanglong
    Yu, Hongfeng
    Jin, Li
    Lv, Zhao
    Fan, Cunhang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10697 - 10714
  • [44] SAMU-XLSR: Semantically-Aligned Multimodal Utterance-Level Cross-Lingual Speech Representation
    Khurana, Sameer
    Laurent, Antoine
    Glass, James
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1493 - 1504
  • [45] Translating Justice: A Cross-Lingual Information Retrieval System for Maltese Case Law Documents
    Azzopardi, Joel
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V, 2024, 14612 : 236 - 240
  • [46] A Web-Based CLIR System with Cross-Lingual Topical Pseudo Relevance Feedback
    Wang, Xuwen
    Wang, Xiaojie
    Zhang, Qiang
    INFORMATION ACCESS EVALUATION: MULTILINGUALITY, MULTIMODALITY, AND VISUALIZATION, 2013, 8138 : 104 - 107
  • [47] Cross-lingual Unified Medical Language System entity linking in online health communities
    Bitton, Yonatan
    Cohen, Raphael
    Schifter, Tamar
    Bachmat, Eitan
    Elhadad, Michael
    Elhadad, Noemie
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (10) : 1585 - 1592
  • [48] Ontology-based Tamil-English cross-lingual information retrieval system
    Thenmozhi, D.
    Aravindan, Chandrabose
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2018, 43 (10):
  • [49] A cross-lingual approach to the development of an HMM-based speech synthesis system for Malay
    Mustafa, Mumtaz B.
    Ainon, Raja N.
    Zainuddin, Roziati
    Don, Zuraidah M.
    Knowles, Gerry
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3204 - 3207
  • [50] Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
    Huang, Po-Yao
    Patrick, Mandela
    Hu, Junjie
    Neubig, Graham
    Metze, Florian
    Hauptmann, Alexander
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2443 - 2459