Hybrid model-and-object-based real-time conversational video coding

被引:3
|
作者
Li, Yang [1 ]
Tao, Xiaoming [1 ]
Lu, Jianhua [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Model-based video coding; Object-based video coding; Face mogdeling; FACIAL ANIMATION; IMAGE; COMPRESSION; SEQUENCES; TRACKING;
D O I
10.1016/j.image.2015.03.009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bandwidth-constrained real-time conversational video communications (such as mobile teleconferencing) require video codecs with good rate-distortion characteristics at low bit-rates and modest computational complexity. While target-specific object-based and model-based coding methods have been proposed for low bit-rate conversational video coding, difficulties in generalization and high computational complexity hinder their practical utilization. In this paper, we propose a low bit-rate coding method for typical conversational video by combining two-dimensional model-based coding of face regions and object-based coding of non-face head-shoulder regions, achieving high-quality face reconstruction and low overall bit-rate with real-time encoding capability. Experiments on typical conversational test sequences confirm that, compared to other conversational video codecs, our model-and-object-based coding method offers superior rate-distortion performance at low bit-rates. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 50 条
  • [31] Real-Time Constant Objective Quality Video Coding Strategy in High Efficiency Video Coding
    Cai, Qi
    Chen, Zhifeng
    Wu, Dapeng Oliver
    Huang, Bo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2215 - 2228
  • [32] iHELP: a model for instant learning of video coding in VR/AR real-time applications
    Sharrab, Yousef O.
    Alsmirat, Mohammad A.
    Eljinini, Mohammad Ali H.
    Sarhan, Nabil J.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 79397 - 79436
  • [33] High-accuracy background model for real-time video foreground object detection
    Tsai, Wen-Kai
    Lin, Chung-Chi
    Sheu, Ming-Hwa
    OPTICAL ENGINEERING, 2012, 51 (02)
  • [34] Model of object-based coding for surveillance video
    Yu, Y
    Doermann, D
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 693 - 696
  • [35] Object detection in real-time video surveillance using attention based transformer-YOLOv8 model
    Nimma, Divya
    Al-Omari, Omaia
    Pradhan, Rahul
    Ulmas, Zoirov
    Krishna, R. V. V.
    El-Ebiary, Ts. Yousef A. Baker
    Rao, Vuda Sreenivasa
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 118 : 482 - 495
  • [36] A unified architecture for real-time video-coding systems
    Li, ZG
    Zhu, C
    Ling, N
    Yang, XK
    Feng, GN
    Wu, S
    Pan, F
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (06) : 472 - 487
  • [37] An efficient video coding control algorithm for real-time implementation
    Hsia, SC
    Chen, CL
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 110 - 113
  • [38] On-the-Fly Erasure Coding for Real-Time Video Applications
    Tournoux, Pierre Ugo
    Lochin, Emmanuel
    Lacan, Jerome
    Bouabdallah, Amine
    Roca, Vincent
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 797 - 812
  • [39] Multiprocessor Performance for Real-Time Processing of Video Coding Applications
    Jeschke, Hartwig
    Gaedke, Klaus
    Pirsch, Peter
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1992, 2 (02) : 221 - 230
  • [40] A real-time N-descriptions video coding architecture
    Franchi, N
    Fumagalli, M
    Lancini, R
    VISUAL CONTENT PROCESSING AND REPRESENTATION, PROCEEDINGS, 2003, 2849 : 267 - 274