Task-Oriented Video Compressive Streaming for Real-Time Semantic Segmentation

被引:1
|
作者
Xiao, Xuedou [1 ]
Zuo, Yingying [2 ]
Yan, Mingxuan [2 ]
Wang, Wei [2 ]
He, Jianhua [3 ]
Zhang, Qian [4 ]
机构
[1] Wuhan Univ Technol, Sch Nav, Wuhan 430062, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Essex Univ, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Clear Water Bay, Hong Kong, Peoples R China
基金
欧盟地平线“2020”; 中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Image coding; Bandwidth; Streaming media; Semantic segmentation; Accuracy; Servers; Predictive coding; Adaptive streaming; DNN-driven compression; edge computing; semantic segmentation;
D O I
10.1109/TMC.2024.3446185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time semantic segmentation (SS) is a major task for various vision-based applications such as self-driving. Due to the limited computing resources and stringent performance requirements, streaming videos from camera-embedded mobile devices to edge servers for SS is a promising approach. While there are increasing efforts on task-oriented video compression, most SS-applicable algorithms apply more uniform compression, as the sensitive regions are less obvious and concentrated. Such processing results in low compression performance and significantly limits the capacity of edge servers supporting real-time SS. In this paper, we propose STAC, a novel task-oriented DNN-driven video compressive streaming algorithm tailed for SS, to strike accuracy-bitrate balance and adapt to time-varying bandwidth. It exploits DNN's gradients as sensitivity metrics for fine-grained spatial adaptive compression and includes a temporal adaptive scheme that integrates spatial adaptation with predictive coding. Furthermore, we design a new bandwidth-aware neural network, serving as a compatible configuration tuner to fit time-varying bandwidth and content. STAC is evaluated in a system with a commodity mobile device and an edge server with real-world network traces. Experiments show that STAC can save up to 63.7-75.2% of bandwidth or improve accuracy by 3.1-9.5% compared to state-of-the-art algorithms, while capable of adapting to time-varying bandwidth.
引用
收藏
页码:14396 / 14413
页数:18
相关论文
共 50 条
  • [31] Efficient ConvNet for Real-time Semantic Segmentation
    Romera, Eduardo
    Alvarez, Jose M.
    Bergasa, Luis M.
    Arroyo, Roberto
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1789 - 1794
  • [32] Real-Time Semantic Segmentation With Fast Attention
    Hu, Ping
    Perazzi, Federico
    Heilbron, Fabian Caba
    Wang, Oliver
    Lin, Zhe
    Saenko, Kate
    Sclaroff, Stan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (01) : 263 - 270
  • [33] Rethinking BiSeNet For Real-time Semantic Segmentation
    Fan, Mingyuan
    Lai, Shenqi
    Huang, Junshi
    Wei, Xiaoming
    Chai, Zhenhua
    Luo, Junfeng
    Wei, Xiaolin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9711 - 9720
  • [34] Background Subtraction With Real-Time Semantic Segmentation
    Zeng, Dongdong
    Chen, Xiang
    Zhu, Ming
    Goesele, Michael
    Kuijper, Arjan
    IEEE ACCESS, 2019, 7 : 153869 - 153884
  • [35] Real-Time Driving Scene Semantic Segmentation
    Wang, Wenfu
    Fu, Yongjian
    Pan, Zhijie
    Li, Xi
    Zhuang, Yueting
    IEEE ACCESS, 2020, 8 : 36776 - 36788
  • [36] Hierarchical Semantic Broadcasting Network for Real-Time Semantic Segmentation
    Li, Genling
    Li, Liang
    Zhang, Jiawan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 309 - 313
  • [37] BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation
    Yu, Changqian
    Wang, Jingbo
    Peng, Chao
    Gao, Changxin
    Yu, Gang
    Sang, Nong
    COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 334 - 349
  • [38] Task-Oriented Real-Time Optimization Method of Dynamic Force Distribution for Multi-Fingered Grasping
    Liu, Ziqi
    Jiang, Li
    Yang, Bin
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2022, 19 (05)
  • [39] Real-time road scene segmentation based on knowledge distillation Real-time road semantic segmentation
    Li, Wenting
    Yang, Huicheng
    Hu, Yaocong
    Lin, Yuanyuan
    Shuai, Zhen
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 429 - 433
  • [40] A real-time video watermarking algorithm for streaming media
    College of Computer, Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu 210003, China
    不详
    不详
    不详
    Cheng, C. (chengcl@njupt.edu.cn), 1600, Advanced Institute of Convergence Information Technology (06):