Multi-core platform for an efficient H.264 and VC-1 video decoding based on macroblock row-level parallelism

被引:7
|
作者
Lee, J. -Y. [1 ,2 ]
Lee, J. -J. [1 ]
Park, S. M. [1 ]
机构
[1] Elect & Telecommun Res Inst, Taejon 305606, South Korea
[2] Univ Sci & Technol, Taejon, South Korea
关键词
Decoding - Video signal processing - Image coding - Scheduling - Software design;
D O I
10.1049/iet-cds.2009.0038
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In order for the video decoding processing such as H. 264 and VC-1 to be effective in multi-core environments, several kinds of parallelisms must be utilised. Here, a novel parallelisation methodology, macroblock row-level parallelism (MBRLP), of video decoding is presented. The ETRI multimedia processing core (EMC) and the ETRI multi-core platform (EMP) are proposed for adopting MBRLP. In terms of the scalability and utilisation of processing cores, MBRLP has advantages over other parallelisation strategies such as frame, slice and macroblock (MB)-level parallelism. The scalability can be easily achieved by just increasing the number of processing cores and applying homogeneous software design/optimisation techniques to each EMC. Instead of employing a dynamic MB-level scheduler, a hybrid approach is used, which is a two-stage functional pipelining combined with MBRLP. The hybrid approach of combining MBRLP and de-blocking pipelining can relieve the synchronisation and inter-processor communication overheads incurred by multicore decoding systems as well as run-time scheduler's overheads. As a result, the proposed parallelisation method and architectures can boost the performance with the efficiency of 83%. The proposed architecture consisting of six EMC clusters has the capability to process D1 (720 x 480) 30 fps real-time decoding at around 200 MHz. The same concept can be applied to full-HD (1920 x 1088) video decoding in this work. It can be found that as the number of processing cores increase, the performance improvement is enhanced almost linearly. The EMP consisting of four EMC clusters (eight cores), memories and other peripherals are prototyped on Xilinx Virtex4 XC4VL200 FPGA which is operating at 60 MHz.
引用
收藏
页码:147 / 158
页数:12
相关论文
共 43 条
  • [41] A Multi-Level Error Protection Technique Based on Macroblocks' Importance of the H.264/AVC Video Frames
    Huu Dung Pham
    Vafi, Sina
    2013 19TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): SMART COMMUNICATIONS TO ENHANCE THE QUALITY OF LIFE, 2013, : 465 - 470
  • [42] Efficient compression of multi-view video exploiting inter-view dependencies based on H.264/MPEG4-AVC
    Merkle, P.
    Mueller, K.
    Smolic, A.
    Wiegand, T.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1717 - +
  • [43] Memory-efficient table look-up optimized algorithm for context-based adaptive variable length decoding in H.264/advanced video coding
    Wang, Jianhua
    Cheng, Lianglun
    Wang, Tao
    Peng, Xiaodong
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (02)