Semantic-Aware Visual Decomposition for Image Coding

被引:3
|
作者
Chang, Jianhui [1 ]
Zhang, Jian [2 ]
Li, Jiguo [3 ]
Wang, Shiqi [4 ]
Mao, Qi [5 ]
Jia, Chuanmin [1 ]
Ma, Siwei [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Visual Technol, Sch Comp Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
Image coding; Semantic-aware visual decomposition; Structure-texture; Coherency regularization; Extremely low bitrate;
D O I
10.1007/s11263-023-01809-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural representation and semantic-wise texture representation and further compressed into bitstreams at the encoder side. On the decoder side, the received bitstreams of dual-layer representations are decoded and reconstructed for target image synthesis with generative models. Moreover, the attention mechanism is introduced into the model architecture for texture representation modeling and a coherency regularization is proposed to further optimize the texture representation space by aligning the representation space with the source pixel space for higher synthesis quality. Besides, we also propose a cross-channel entropy module and control the quantization scale to facilitate rate-distortion optimization. Upon compressing the decomposed components into the bitstream, the simple yet effective representation philosophy benefits image compression in many aspects. First, in terms of compression performance, compact representations, and high visual synthesis quality can bring remarkable advantages. Second, the proposed framework yields a physically explainable bitstream composed of the structural segment and semantic-wise texture segments. Third and most importantly, subsequent vision tasks (e.g., content manipulation) can receive fundamental support from the semantic-aware visual decomposition and synthesis mechanism. Extensive experimental results demonstrate the superiority of the proposed framework towards efficient visual representation learning, high efficiency image compression (< 0.1 bpp), and intelligent visual applications (e.g., manipulation and analysis).
引用
收藏
页码:2333 / 2355
页数:23
相关论文
共 50 条
  • [31] Semantic-Aware Data Augmentation for Text-to-Image Synthesis
    Tan, Zhaorui
    Yang, Xi
    Huang, Kaizhu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5098 - 5107
  • [32] Semantic-Aware Global and Local Fusion Model for Image Enhancement
    Chen, Mengjie
    Xu, Hao
    Lv, Pei
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 27 - 41
  • [33] Multistage semantic-aware image inpainting with stacked generator networks
    Ren, Yongpeng
    Ren, Hongping
    Shi, Canghong
    Zhang, Xian
    Wu, Xi
    Li, Xiaojie
    Lv, Jiancheng
    Zhou, Jiliu
    Mumtaz, Imran
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (02) : 1599 - 1617
  • [34] SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2150 - 2154
  • [35] Semantic-aware expert partitioning
    1600, Springer Verlag (8722):
  • [36] Semantic-Aware Makeup Cleanser
    Li, Yi
    Huang, Huaibo
    Yu, Junchi
    He, Ran
    Tan, Tieniu
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON BIOMETRICS THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2019,
  • [37] Semantic-Aware Expert Partitioning
    Boeva, Veselka
    Boneva, Lilyana
    Tsiporkova, Elena
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, 2014, 8722 : 13 - 24
  • [38] Semantic-aware Iot platforms
    Ben Alaya, Mahdi
    Drira, Khalil
    Gharbi, Ghada
    2017 IEEE 6TH INTERNATIONAL CONFERENCE ON AI & MOBILE SERVICES (AIMS), 2017, : 8 - 13
  • [39] Semantic-aware scene recognition
    Lopez-Cifuentes, Alejandro
    Escudero-Vinolo, Marcos
    Bescos, Jesus
    Garcia-Martin, Alvaro
    PATTERN RECOGNITION, 2020, 102
  • [40] Semantic-Aware Vulnerability Detection
    Huang, Zhen
    White, Marc
    2022 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2022, : 68 - 75