Semantic-Aware Visual Decomposition for Image Coding

被引:3
|
作者
Chang, Jianhui [1 ]
Zhang, Jian [2 ]
Li, Jiguo [3 ]
Wang, Shiqi [4 ]
Mao, Qi [5 ]
Jia, Chuanmin [1 ]
Ma, Siwei [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Visual Technol, Sch Comp Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
Image coding; Semantic-aware visual decomposition; Structure-texture; Coherency regularization; Extremely low bitrate;
D O I
10.1007/s11263-023-01809-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural representation and semantic-wise texture representation and further compressed into bitstreams at the encoder side. On the decoder side, the received bitstreams of dual-layer representations are decoded and reconstructed for target image synthesis with generative models. Moreover, the attention mechanism is introduced into the model architecture for texture representation modeling and a coherency regularization is proposed to further optimize the texture representation space by aligning the representation space with the source pixel space for higher synthesis quality. Besides, we also propose a cross-channel entropy module and control the quantization scale to facilitate rate-distortion optimization. Upon compressing the decomposed components into the bitstream, the simple yet effective representation philosophy benefits image compression in many aspects. First, in terms of compression performance, compact representations, and high visual synthesis quality can bring remarkable advantages. Second, the proposed framework yields a physically explainable bitstream composed of the structural segment and semantic-wise texture segments. Third and most importantly, subsequent vision tasks (e.g., content manipulation) can receive fundamental support from the semantic-aware visual decomposition and synthesis mechanism. Extensive experimental results demonstrate the superiority of the proposed framework towards efficient visual representation learning, high efficiency image compression (< 0.1 bpp), and intelligent visual applications (e.g., manipulation and analysis).
引用
收藏
页码:2333 / 2355
页数:23
相关论文
共 50 条
  • [1] Semantic-Aware Visual Decomposition for Image Coding
    Jianhui Chang
    Jian Zhang
    Jiguo Li
    Shiqi Wang
    Qi Mao
    Chuanmin Jia
    Siwei Ma
    Wen Gao
    International Journal of Computer Vision, 2023, 131 : 2333 - 2355
  • [2] Semantic-Aware Visual Decomposition for Point Cloud Geometry Compression
    Xie, Liang
    Gao, Wei
    Zheng, Huiming
    Ye, Hua
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 595 - 595
  • [3] Semantic-aware visual consistency network for fused image harmonisation
    Yu, Huayan
    Huang, Hai
    Zhu, Yueyan
    Chen, Aoran
    IET SIGNAL PROCESSING, 2023, 17 (06)
  • [4] Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning
    Song, Kaiyou
    Zhang, Shan
    Wang, Tong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4925 - 4933
  • [5] Semantic-aware visual scene representation
    Mohammad Javad Parseh
    Mohammad Rahmanimanesh
    Parviz Keshavarzi
    Zohreh Azimifar
    International Journal of Multimedia Information Retrieval, 2022, 11 : 619 - 638
  • [6] Semantic-aware visual scene representation
    Parseh, Mohammad Javad
    Rahmanimanesh, Mohammad
    Keshavarzi, Parviz
    Azimifar, Zohreh
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (04) : 619 - 638
  • [7] Deep Separate Source-channel Coding for Semantic-aware Image Transmission
    Huang, Jianhao
    Li, Dongxu
    Huang, Chuan
    Qin, Xiaoqi
    Zhang, Wei
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5626 - 5631
  • [8] Semantic-aware blind image quality assessment
    Siahaan, Ernestasia
    Hanjalic, Alan
    Redi, Judith A.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 60 : 237 - 252
  • [9] Semantic-aware framework for Mobile Image Search
    Bouhlel, Noura
    Ksibi, Amel
    Ben Ammar, Anis
    Ben Amar, Chokri
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 479 - 484
  • [10] Semantic-aware Hashing for Social Image Retrieval
    Tang, Jinhui
    Li, Zechao
    Zhang, Liyan
    Huang, Qingming
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 483 - 486