On the impact of learning-based image compression on computer vision tasks

被引:0
|
作者
Akamatsu, Shunsuke [1 ]
Testolina, Michela [2 ]
Upenik, Evgeniy [2 ]
Ebrahimi, Touradj [2 ]
机构
[1] Waseda Univ, Adv Multimedia Syst Lab, Shillman Hall 401,3-14-9 Okubo,Shinjuku Ku, Tokyo 1690072, Japan
[2] Ecole Polytech Fed Lausanne EPFL, Multimedia Signal Proc Grp MMSPG, CH-1015 Lausanne, Switzerland
关键词
JPEG AI; learning-based image compression; computer vision; image classification; object detection;
D O I
10.1117/12.3030885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The image compression field is witnessing a shift in paradigm thanks to the rise of neural network-based models. In this context, the JPEG committee is in the process of standardizing the first learning-based image compression standard, known as JPEG AI. While most of the research to date has focused on image coding for humans, JPEG AI plans to address both human and machine vision by presenting several non-normative decoders addressing multiple image processing and computer vision tasks in addition to standard reconstruction. While the impact of conventional image compression on computer vision tasks has already been addressed, no study has been conducted to assess the impact of learning-based image compression on such tasks. In this paper, the impact of learning-based image compression, including JPEG AI, on computer vision tasks is reviewed and discussed, mainly focusing on the image classification task along with object detection and segmentation. This study reviews the impact of image compression with JPEG AI on various computer vision models. It shows the superiority of JPEG AI over other conventional and learning-based compression models.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Learning-based Visual Compression
    Ji, Ruolei
    Karam, Lina J.
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2023, 15 (01): : 1 - 112
  • [22] Heuristic particle swarm optimization learning-based image compression system
    Feng, Hsuan-Ming
    Chen, Ching-Yi
    Ye, Fun
    CYBERNETICS AND SYSTEMS, 2008, 39 (05) : 520 - 537
  • [23] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
  • [24] MULTI-MODE INTRA PREDICTION FOR LEARNING-BASED IMAGE COMPRESSION
    Jung, Henrique Costa
    Guerin Jr, Nilson Donizete
    Ramos, Raphael Soares
    Macchiavello, Bruno
    Peixoto, Eduardo
    Hung, Edson Mintsu
    de Campos, Teofilo
    da Silva, Renam Castro
    Testoni, Vanessa
    Freitas, Pedro Garcia
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1296 - 1300
  • [25] Review of computer vision application in in vitro fertilization: the application of deep learning-based computer vision technology in the world of IVF
    Claudio Michael Louis
    Alva Erwin
    Nining Handayani
    Arie A. Polim
    Arief Boediono
    Ivan Sini
    Journal of Assisted Reproduction and Genetics, 2021, 38 : 1627 - 1639
  • [26] Review of computer vision application in in vitro fertilization: the application of deep learning-based computer vision technology in the world of IVF
    Louis, Claudio Michael
    Erwin, Alva
    Handayani, Nining
    Polim, Arie A.
    Boediono, Arief
    Sini, Ivan
    JOURNAL OF ASSISTED REPRODUCTION AND GENETICS, 2021, 38 (07) : 1627 - 1639
  • [27] Experimental Evaluation of Computer Vision and Machine Learning-Based UAV Detection and Ranging
    Wei, Bingsheng
    Barczyk, Martin
    DRONES, 2021, 5 (02)
  • [28] Introduction to Computer Vision and Real Time Deep Learning-based Object Detection
    Shanahan, James G.
    Dai, Liang
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3523 - 3524
  • [29] Construction Instance Segmentation (CIS) Dataset for Deep Learning-Based Computer Vision
    Yan, Xuzhong
    Zhang, Hong
    Wu, Yefei
    Lin, Chen
    Liu, Shengwei
    AUTOMATION IN CONSTRUCTION, 2023, 156
  • [30] Introduction to Computer Vision and Real Time Deep Learning-based Object Detection
    Shanahan, James G.
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3515 - 3516