Generative AI in Multimodal Cross-Lingual Dialogue System for Inclusive Communication Support

被引:0
|
作者
Nataraj, Vidhya [1 ]
Liao, Wen-Hsuan [2 ]
Chang, Yue-Shan [3 ]
Chiang, Chen-Yu [4 ]
Lin, Chao-Yin [5 ]
Lin, Yu-An [5 ]
Day, Min-Yuh [2 ]
机构
[1] Natl Taipei Univ, Smart Healthcare Management, New Taipei, Taiwan
[2] Natl Taipei Univ, Grad Inst Informat Management, New Taipei, Taiwan
[3] Natl Taipei Univ, Dept Comp Sci & Informat Engn, New Taipei, Taiwan
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
[5] Natl Taipei Univ, Dept Social Work, New Taipei, Taiwan
关键词
Generative AI; Large Language Models (LLMs); Multimodal; Cross-lingual; Dialogue System; Inclusive Communication Support;
D O I
10.1109/IRI62200.2024.00051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements in natural language processing have enhanced dialogue systems, making them vital for inclusive technology that facilitates accessible interactions across diverse user needs. However, existing systems often struggle with multimodal inputs, multilingual support, and generating contextually appropriate responses in data-scarce environments. This research addresses these gaps by developing an integrated dialogue system leveraging generative AI models like ChatGPT and multimodal inputs like text, audio, and image. The system utilizes transfer learning and large language models (LLMs) to process multilingual data, generating comprehensive responses tailored to user context. The proposed approach constructs a multimodal cross-lingual task-oriented dialogue system capable of understanding and responding to users in multiple languages and modalities. The proposed multimodal cross-lingual task-oriented dialogue system will enhance functionality and inclusivity compared to traditional unimodal or single-language dialogue systems in providing inclusive communication support. The major research contribution of this study highlights the potential of generative AI in developing accessible dialogue systems that cater to diverse user needs to advance inclusive technology. Practitioner implications of this paper highlight the potential of multimodal cross-lingual dialogue system to foster digital inclusion and inclusive communication support, improving accessibility and equity in human-computer interactions for diverse users.
引用
收藏
页码:204 / 209
页数:6
相关论文
共 50 条
  • [31] A fuzzy knowledge-based system for cross-lingual text retrieval
    Chau, R
    Yeh, CH
    COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - EVOLUTIONARY COMPUTATION & FUZZY LOGIC FOR INTELLIGENT CONTROL, KNOWLEDGE ACQUISITION & INFORMATION RETRIEVAL, 1999, 55 : 488 - 494
  • [32] Study on Cross-Lingual Adaptation of a Czech LVCSR System towards Slovak
    Cerva, Petr
    Nouza, Jan
    Silovsky, Jan
    ANALYSIS OF VERBAL AND NONVERBAL COMMUNICATION AND ENACTMENT: THE PROCESSING ISSUES, 2011, 6800 : 81 - 87
  • [33] Cost-efficient cross-lingual adaptation of a speech recognition system
    Callejas, Zoraida
    Nouza, Jan
    Cerva, Petr
    López-Cózar, Ramón
    Advances in Intelligent and Soft Computing, 2009, 57 : 331 - 338
  • [34] Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
    Liu, Lei
    Huang, Jimmy Xiangji
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2287 - 2292
  • [35] AfriWOZ: Corpus for Exploiting Cross-Lingual Transfer for Dialogue Generation in Low-Resource, African Languages
    Adewumi, Tosin
    Adeyemi, Mofetoluwa
    Anuoluwapo, Aremu
    Peters, Bukola
    Buzaaba, Happy
    Samuel, Oyerinde
    Rufai, Amina Mardiyyah
    Ajibade, Benjamin
    Gwadabe, Tajudeen
    Traore, Mory Moussou Koulibaly
    Ajayi, Tunde Oluwaseyi
    Muhammad, Shamsuddeen
    Baruwa, Ahmed
    Owoicho, Paul
    Ogunremi, Tolulope
    Ngigi, Phylis
    Ahia, Orevaoghene
    Nasir, Ruqayya
    Liwicki, Foteini
    Liwicki, Marcus
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [36] A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems
    Kim, San
    Jang, Jin Yea
    Jung, Minyoung
    Shin, Saim
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 352 - 365
  • [37] A Cross-Lingual Mobile Medical Communication System Prototype for Foreigners and Subjects with Speech, Hearing, and Mental Disabilities Based on Pictograms
    Wolk, Krzysztof
    Wolk, Agnieszka
    Glinkowski, Wojciech
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2017, 2017
  • [38] Ontology-based Tamil–English cross-lingual information retrieval system
    D Thenmozhi
    Chandrabose Aravindan
    Sādhanā, 2018, 43
  • [39] KANSHIN: A Cross-lingual Concern Analysis System using Multilingual Blog Articles
    Fukuhara, Tomohiro
    Kimura, Akifumi
    Arai, Yoshiaki
    Yoshinaka, Takayuki
    Masuda, Hidetaka
    Utsuro, Takehito
    Nakagawa, Hiroshi
    2008 INTERNATIONAL WORKSHOP ON INFORMATION-EXPLOSION AND NEXT GENERATION SEARCH : INGS 2008, PROCEEDINGS, 2008, : 83 - +
  • [40] Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial Network
    Ho, Tuan Vu
    Akagi, Masato
    IEEE ACCESS, 2021, 9 : 47503 - 47515