Generative AI in Multimodal Cross-Lingual Dialogue System for Inclusive Communication Support

被引:0
|
作者
Nataraj, Vidhya [1 ]
Liao, Wen-Hsuan [2 ]
Chang, Yue-Shan [3 ]
Chiang, Chen-Yu [4 ]
Lin, Chao-Yin [5 ]
Lin, Yu-An [5 ]
Day, Min-Yuh [2 ]
机构
[1] Natl Taipei Univ, Smart Healthcare Management, New Taipei, Taiwan
[2] Natl Taipei Univ, Grad Inst Informat Management, New Taipei, Taiwan
[3] Natl Taipei Univ, Dept Comp Sci & Informat Engn, New Taipei, Taiwan
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
[5] Natl Taipei Univ, Dept Social Work, New Taipei, Taiwan
关键词
Generative AI; Large Language Models (LLMs); Multimodal; Cross-lingual; Dialogue System; Inclusive Communication Support;
D O I
10.1109/IRI62200.2024.00051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements in natural language processing have enhanced dialogue systems, making them vital for inclusive technology that facilitates accessible interactions across diverse user needs. However, existing systems often struggle with multimodal inputs, multilingual support, and generating contextually appropriate responses in data-scarce environments. This research addresses these gaps by developing an integrated dialogue system leveraging generative AI models like ChatGPT and multimodal inputs like text, audio, and image. The system utilizes transfer learning and large language models (LLMs) to process multilingual data, generating comprehensive responses tailored to user context. The proposed approach constructs a multimodal cross-lingual task-oriented dialogue system capable of understanding and responding to users in multiple languages and modalities. The proposed multimodal cross-lingual task-oriented dialogue system will enhance functionality and inclusivity compared to traditional unimodal or single-language dialogue systems in providing inclusive communication support. The major research contribution of this study highlights the potential of generative AI in developing accessible dialogue systems that cater to diverse user needs to advance inclusive technology. Practitioner implications of this paper highlight the potential of multimodal cross-lingual dialogue system to foster digital inclusion and inclusive communication support, improving accessibility and equity in human-computer interactions for diverse users.
引用
收藏
页码:204 / 209
页数:6
相关论文
共 50 条
  • [21] Towards Making the Most of Knowledge Across Languages for Multimodal Cross-Lingual Summarization
    Shi, Xiaorui
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 424 - 438
  • [22] An integrated information retrieval support system for multiple distributed heterogeneous cross-lingual information sources
    Qiao, L
    Huang, WT
    Wen, Q
    Fu, XL
    NETWORKING AND MOBILE COMPUTING, PROCEEDINGS, 2005, 3619 : 863 - 872
  • [23] picoTrans: An Intelligent Icon-Driven Interface for Cross-Lingual Communication
    Song, Wei
    Finch, Andrew
    Tanaka-Ishii, Kumiko
    Yasuda, Keiji
    Sumita, Eiichiro
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2013, 3 (01)
  • [24] A Semantic Content Based Recommendation System for Cross-Lingual News
    Ferdous, Syeda Nyma
    Ali, Muhammad Masroor
    2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,
  • [25] Enhancing Cross-Lingual Image Description: A Multimodal Approach for Semantic Relevance and Stylistic Alignment
    Al-Buraihy, Emran
    Wang, Dan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 3913 - 3938
  • [26] Zero-Shot Cross-Lingual Knowledge Transfer in VQA via Multimodal Distillation
    Weng, Yu
    Dong, Jun
    He, Wenbin
    Chaomurilige
    Liu, Xuan
    Liu, Zheng
    Gao, Honghao
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 11
  • [27] Spoken language translation - Enabling cross-lingual human-human communication
    Waibel, Alex
    Fuegen, Christian
    IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) : 70 - 79
  • [28] Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction
    Huang, Kuan-Hao
    Hsu, I-Hung
    Natarajan, Premkumar
    Chang, Kai-Wei
    Peng, Nanyun
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4633 - 4646
  • [29] Improving Zero-Shot Cross-Lingual Dialogue State Tracking via Contrastive Learning
    Xiang, Yu
    Zhang, Ting
    Di, Hui
    Huang, Hui
    Li, Chunyou
    Ouchi, Kazushige
    Chen, Yufeng
    Xu, Jinan
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 127 - 141
  • [30] Enhancing a Rule-Based MT System with Cross-Lingual WSD
    Rudnick, Alex
    Rios, Annette
    Gasser, Michael
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,