Task-Oriented Video Compressive Streaming for Real-Time Semantic Segmentation

被引:1
|
作者
Xiao, Xuedou [1 ]
Zuo, Yingying [2 ]
Yan, Mingxuan [2 ]
Wang, Wei [2 ]
He, Jianhua [3 ]
Zhang, Qian [4 ]
机构
[1] Wuhan Univ Technol, Sch Nav, Wuhan 430062, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Essex Univ, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Clear Water Bay, Hong Kong, Peoples R China
基金
欧盟地平线“2020”; 中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Image coding; Bandwidth; Streaming media; Semantic segmentation; Accuracy; Servers; Predictive coding; Adaptive streaming; DNN-driven compression; edge computing; semantic segmentation;
D O I
10.1109/TMC.2024.3446185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time semantic segmentation (SS) is a major task for various vision-based applications such as self-driving. Due to the limited computing resources and stringent performance requirements, streaming videos from camera-embedded mobile devices to edge servers for SS is a promising approach. While there are increasing efforts on task-oriented video compression, most SS-applicable algorithms apply more uniform compression, as the sensitive regions are less obvious and concentrated. Such processing results in low compression performance and significantly limits the capacity of edge servers supporting real-time SS. In this paper, we propose STAC, a novel task-oriented DNN-driven video compressive streaming algorithm tailed for SS, to strike accuracy-bitrate balance and adapt to time-varying bandwidth. It exploits DNN's gradients as sensitivity metrics for fine-grained spatial adaptive compression and includes a temporal adaptive scheme that integrates spatial adaptation with predictive coding. Furthermore, we design a new bandwidth-aware neural network, serving as a compatible configuration tuner to fit time-varying bandwidth and content. STAC is evaluated in a system with a commodity mobile device and an edge server with real-world network traces. Experiments show that STAC can save up to 63.7-75.2% of bandwidth or improve accuracy by 3.1-9.5% compared to state-of-the-art algorithms, while capable of adapting to time-varying bandwidth.
引用
收藏
页码:14396 / 14413
页数:18
相关论文
共 50 条
  • [21] Task-Oriented Semantic Communications for Speech Transmission
    Weng, Zhenzi
    Qin, Zhijin
    Tao, Xiaoming
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [22] Task-Oriented Semantic Communication with Foundation Models
    Chen Mingkai
    Liu Minghao
    Zhang Zhe
    Xu Zhiping
    Wang Lei
    China Communications, 2024, 21 (07) : 65 - 77
  • [23] Semantic Equivalence of Task-Oriented Programs in TopHat
    Klijnsma, Tosca
    Steenvoorden, Tim
    TRENDS IN FUNCTIONAL PROGRAMMING, TFP 2022, 2022, 13401 : 100 - 125
  • [24] Task-Oriented Semantic Communication with Foundation Models
    Chen, Mingkai
    Liu, Minghao
    Zhe, Zhang
    Xu, Zhiping
    Lei, Wang
    CHINA COMMUNICATIONS, 2024, 21 (07) : 65 - 77
  • [25] Diagnosing Transformers in Task-Oriented Semantic Parsing
    Desai, Shrey
    Aly, Ahmed
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 57 - 62
  • [26] A safety-related PES for task-oriented real-time execution without asynchronous interrupts
    Skambraks, M
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, PROCEEDINGS, 2005, 3688 : 261 - 274
  • [27] Task-Oriented Communication for Edge Video Analytics
    Shao, Jiawei
    Zhang, Xinjie
    Zhang, Jun
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 4141 - 4154
  • [28] Real-Time Semantic Segmentation with Label Propagation
    Sheikh, Rasha
    Garbade, Martin
    Gall, Juergen
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 3 - 14
  • [29] TASK-ORIENTED CAMERA ASSIGNMENT IN A VIDEO NETWORK
    Li, Yiming
    Bhanu, Bir
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3473 - 3476
  • [30] Real-time Semantic Segmentation for Road Scene
    Zhang, Xuetao
    Chen, Zhenxue
    Lu, Dan
    Li, Xianming
    2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 19 - 23