Task-Oriented Multi-User Semantic Communications

被引:146
|
作者
Xie, Huiqiang [1 ]
Qin, Zhijin [1 ]
Tao, Xiaoming [2 ]
Letaief, Khaled B. [3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Transmitters; Transformers; Receivers; Image retrieval; Machine translation; Deep learning; semantic communications; multimodal fusion; multi-user communications; transformer; WIRELESS COMMUNICATIONS; INTERNET;
D O I
10.1109/JSAC.2022.3191326
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We propose a Transformer based framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between images and DeepSC-MT is trained to minimize the semantic errors by recovering the semantic meaning of sentences. For the multimodal multi-user system, we develop a Transformer enabled model, named, DeepSC-VQA, for the VQA task by extracting text-image information at the transmitters and fusing it at the receiver. In particular, a novel layer-wise Transformer is designed to help fuse multimodal data by adding connection between each of the encoder and decoder layers. Numerical results show that the proposed models are superior to traditional communications in terms of the robustness to channels, computational complexity, transmission delay, and the task-execution performance at various task-specific metrics.
引用
收藏
页码:2584 / 2597
页数:14
相关论文
共 50 条
  • [1] Task-Oriented Multi-User Semantic Communications for VQA
    Xie, Huiqiang
    Qin, Zhijin
    Li, Geoffrey Ye
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (03) : 553 - 557
  • [2] Task-Oriented Explainable Semantic Communications
    Ma, Shuai
    Qiao, Weining
    Wu, Youlong
    Li, Hang
    Shi, Guangming
    Gao, Dahua
    Shi, Yuanming
    Li, Shiyin
    Al-Dhahir, Naofal
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9248 - 9262
  • [3] Multi-User MultiWOZ: Task-Oriented Dialogues among Multiple Users
    Jo, Yohan
    Zhao, Xinyan
    Biswas, Arijit
    Basiou, Nikoletta
    Auvray, Vincent
    Malandrakis, Nikolaos
    Metallinou, Angeliki
    Potamianos, Alexandros
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3237 - 3269
  • [4] Task-Oriented Semantic Communications for Speech Transmission
    Weng, Zhenzi
    Qin, Zhijin
    Tao, Xiaoming
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [5] Task-Oriented Multi-User Semantic Communication With Lightweight Semantic Encoder and Fast Training for Resource-Constrained Terminal Devices
    Peng, Jincheng
    Xing, Huanlai
    Li, Yang
    Feng, Li
    Xu, Lexi
    Lei, Xianfu
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (09) : 2427 - 2431
  • [6] Adversarial Reinforcement Learning Based Data Poisoning Attacks Defense for Task-Oriented Multi-User Semantic Communication
    Peng, Jincheng
    Xing, Huanlai
    Xu, Lexi
    Luo, Shouxi
    Dai, Penglin
    Feng, Li
    Song, Jing
    Zhao, Bowen
    Xiao, Zhiwen
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 14834 - 14851
  • [7] Stacked Intelligent Metasurfaces for Task-Oriented Semantic Communications
    Huang, Guojun
    An, Jiancheng
    Yang, Zhaohui
    Gan, Lu
    Bennis, Mehdi
    Debbah, Merouane
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2025, 14 (02) : 310 - 314
  • [8] Intelligent task-oriented semantic communications: theory, technology and challenges
    Liu, Chuanhong
    Guo, Caili
    Yang, Yang
    Chen, Jiujiu
    Zhu, Meiyi
    Sun, Lu'nan
    Tongxin Xuebao/Journal on Communications, 2022, 43 (06): : 41 - 57
  • [9] Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications
    Liu, Chuanhong
    Guo, Caili
    Yang, Yang
    Jiang, Nan
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 769 - 782
  • [10] Multi-User Semantic Communications for Cooperative Object Identification
    Zhang, Yimeng
    Xu, Wenjun
    Gao, Hui
    Wang, Fengyu
    2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 157 - 162