Hybrid model-and-object-based real-time conversational video coding

被引：3

作者：

Li, Yang ^{[1
]}

Tao, Xiaoming ^{[1
]}

Lu, Jianhua ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2015年 / 35卷

基金：

中国国家自然科学基金;

关键词：

Model-based video coding; Object-based video coding; Face mogdeling; FACIAL ANIMATION; IMAGE; COMPRESSION; SEQUENCES; TRACKING;

D O I：

10.1016/j.image.2015.03.009

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Bandwidth-constrained real-time conversational video communications (such as mobile teleconferencing) require video codecs with good rate-distortion characteristics at low bit-rates and modest computational complexity. While target-specific object-based and model-based coding methods have been proposed for low bit-rate conversational video coding, difficulties in generalization and high computational complexity hinder their practical utilization. In this paper, we propose a low bit-rate coding method for typical conversational video by combining two-dimensional model-based coding of face regions and object-based coding of non-face head-shoulder regions, achieving high-quality face reconstruction and low overall bit-rate with real-time encoding capability. Experiments on typical conversational test sequences confirm that, compared to other conversational video codecs, our model-and-object-based coding method offers superior rate-distortion performance at low bit-rates. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：9 / 19

页数：11

共 50 条

[31] Real-Time Constant Objective Quality Video Coding Strategy in High Efficiency Video Coding
Cai, Qi
Chen, Zhifeng
Wu, Dapeng Oliver
Huang, Bo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2215 - 2228
[32] iHELP: a model for instant learning of video coding in VR/AR real-time applications
Sharrab, Yousef O.
Alsmirat, Mohammad A.
Eljinini, Mohammad Ali H.
Sarhan, Nabil J.
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 79397 - 79436
[33] High-accuracy background model for real-time video foreground object detection
Tsai, Wen-Kai
Lin, Chung-Chi
Sheu, Ming-Hwa
OPTICAL ENGINEERING, 2012, 51 (02)
[34] Model of object-based coding for surveillance video
Yu, Y
Doermann, D
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 693 - 696
[35] Object detection in real-time video surveillance using attention based transformer-YOLOv8 model
Nimma, Divya
Al-Omari, Omaia
Pradhan, Rahul
Ulmas, Zoirov
Krishna, R. V. V.
El-Ebiary, Ts. Yousef A. Baker
Rao, Vuda Sreenivasa
ALEXANDRIA ENGINEERING JOURNAL, 2025, 118 : 482 - 495
[36] A unified architecture for real-time video-coding systems
Li, ZG
Zhu, C
Ling, N
Yang, XK
Feng, GN
Wu, S
Pan, F
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (06) : 472 - 487
[37] An efficient video coding control algorithm for real-time implementation
Hsia, SC
Chen, CL
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 110 - 113
[38] On-the-Fly Erasure Coding for Real-Time Video Applications
Tournoux, Pierre Ugo
Lochin, Emmanuel
Lacan, Jerome
Bouabdallah, Amine
Roca, Vincent
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 797 - 812
[39] Multiprocessor Performance for Real-Time Processing of Video Coding Applications
Jeschke, Hartwig
Gaedke, Klaus
Pirsch, Peter
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1992, 2 (02) : 221 - 230
[40] A real-time N-descriptions video coding architecture
Franchi, N
Fumagalli, M
Lancini, R
VISUAL CONTENT PROCESSING AND REPRESENTATION, PROCEEDINGS, 2003, 2849 : 267 - 274

← 1 2 3 4 5 →