Innovative Insights: A Review of Deep Learning Methods for Enhanced Video Compression

被引：0

作者：

Khadir, Mohammad ^{[1
]}

Farukh Hashmi, Mohammad ^{[1
]}

Kotambkar, Deepali M. ^{[2
]}

Gupta, Aditya ^{[3
]}

机构：

[1] Natl Inst Technol, Dept Elect & Commun Engn, Warangal 506004, India

[2] Ramdeobaba Univ, Dept Elect Engn, Nagpur 440013, India

[3] Univ Agder, Dept Informat & Commun Technol, N-4886 Grimstad, Norway

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Video compression; Video coding; Streaming media; Surveys; Transform coding; Deep learning; Reviews; Convolutional neural networks; Generative adversarial networks; Convolutional neural network; deep learning; deep neural network; generative adversarial network and video compression; NEURAL-NETWORKS; AUTO-ENCODER; CODEC; RECONSTRUCTION; DATABASE;

D O I：

10.1109/ACCESS.2024.3450814

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video Compression (VC) is a significant aspect of multimedia technology, in which the goal to minimize the size of video data, while also preserving its perceptual quality, for effective transmission and storage. Traditional approaches such as transform coding, predictive coding, and entropy coding are some of the much earlier discovered approaches in this area. VC is a challenging concept which plays a significant role in the effective transmission of data with low storage and minimum bandwidth requirements. However, the limited processing power, storage, memory, lower compression rate and lower resolution are some factors that impact the functionality and performance of VC. This survey aims to encompass a comprehensive review of present DL approaches for VC, especially the application of advanced DL-based Neural Network (NN) algorithms that are developed for solving the aforementioned challenges of VC. Adaptability of DL algorithms is exploited to enhance the potential quality of compressed videos and to positively influence lossless video compression outcomes. The DL approaches include Deep Neural Networks (DNN) methods such as Convolutional Neural Networks (CNN), Generative Adversarial Networks (GAN), Recurrent Neural Networks (RNN), Deep Recurrent Auto-Encoder (DRAE), etc. This survey examines the relationships, strengths as well as problem statements of DL-based compression approaches of VC. Furthermore, this survey also deliberates on datasets, hardware specifications, comparative analysis, and research directions. This survey embeds DL-based computer vision approaches, with hardware accelerators like GPU and FPGA, to minimize the complexity of in a model. This survey aims to overcome the limitations of VC, such as the varying effectiveness of specific encoder approaches, the challenges in utilizing hardware accelerators, low-resource devices, and difficulties in managing the large-scale databases. Integrating DL-based approaches with existing standard codecs remains a significant challenge. Ensuring compatibility, interoperability, and standardization is important for widespread adoption and integration. Enhancing the interpretability and control of DL approaches permit for better customization of compression settings, allowing the users to balance bit rate and quality according to their specific requirements. To gather relevant studies, widespread VC datasets are researched and utilized such as, Ultra-Video-Group dataset (UVG), Video Trace Library (VTL), etc. The selection criteria for this study of VC techniques and deep learning (DL) approaches are chosen to focus on the integration of DL with codecs, which is a primary research area of interest. This integration provides valuable insights into advanced DL applications in overcoming challenges associated with VC. Frameworks such as TensorFlow, Keras, PyTorch are utilized to classify the approaches according to their fundamental NN architectures.

引用

页码：125706 / 125725

页数：20

共 50 条

[1] Deep Learning Based Video Compression
Ji, Kang Da
Hlavacs, Helmut
INTELLIGENT TECHNOLOGIES FOR INTERACTIVE ENTERTAINMENT, INTETAIN 2021, 2022, 429 : 127 - 141
[2] Enhanced Motion Compensation for Deep Video Compression
Guo, Haifeng
Kwong, Sam
Jia, Chuanmin
Wang, Shiqi
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 673 - 677
[3] Review and Evaluation of End-to-End Video Compression with Deep-Learning
Yasin, Hajar Maseeh
Ameen, Siddeeq Yosef
2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
[4] Deep Learning Methods for Video Understanding
dos Santos, Gabriel N. P.
de Freitas, Pedro V. A.
Busson, Antonio Jose G.
Guedes, Alan L., V
Milidi, Ruy
Colcher, Sergio
WEBMEDIA 2019: PROCEEDINGS OF THE 25TH BRAZILLIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, 2019, : 21 - 23
[5] Motion estimation methods for video compression - A review
Tabatabai, AJ
Jasinschi, RS
Naveen, T
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1998, 335B (08): : 1411 - 1441
[6] Deep Learning in Latent Space for Video Prediction and Compression
Liu, Bowen
Chen, Yu
Liu, Shiyu
Kim, Hun-Seok
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 701 - 710
[7] Deep Learning Approaches for Video Compression: A Bibliometric Analysis
Bidwe, Ranjeet Vasant
Mishra, Sashikala
Patil, Shruti
Shaw, Kailash
Vora, Deepali Rahul
Kotecha, Ketan
Zope, Bhushan
BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
[8] Deep Learning-Assisted Video Compression Framework
Man, Hengyu
Yu, Chang
Xing, Feng
Cheng, Yang
Zheng, Bo
Fan, Xiaopeng
2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3210 - 3214
[9] A comprehensive review on deep learning-based methods for video anomaly detection
Nayak, Rashmiranjan
Pati, Umesh Chandra
Das, Santos Kumar
IMAGE AND VISION COMPUTING, 2021, 106
[10] Deep Learning for Video Captioning: A Review
Chen, Shaoxiang
Yao, Ting
Jiang, Yu-Gang
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6283 - 6290

← 1 2 3 4 5 →