Refining instruction set architecture for high-performance multimedia processing in constrained environments

被引:6
|
作者
Lee, RB [1 ]
Fiskiran, AM [1 ]
Shi, ZJ [1 ]
Yang, M [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, PALMS, Princeton, NJ 08544 USA
关键词
D O I
10.1109/ASAP.2002.1030724
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modern microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, special-purpose functional units that are not practical for constrained environments such as handheld multimedia information appliances. For such environments, low-power and low-cost are as important as the high performance required for real-time multimedia processing and the general-purpose programmability required to support an ever growing range of applications. In this paper, we introduce PLX, a concise ISA that selects the most useful features from the first two generations of multimedia instructions added to microprocessors, and explores new ISA features for high-performance yet low-cost multimedia processing with small footprint processors. PLX is unique in that it is designed from scratch as a fully subword-parallel architecture with novel features like datapath scalability from 32-bit to 128-bit words, and a new definition of predication for reducing conditional branches. We illustrate the use of PLX's architectural features with four frequently used multimedia kernels: discrete cosine transform, pixel padding, clip test and median filter. Our performance results show that a 64-bit PLX implementation achieves significant speedups compared to a basic 64-bit RISC processor and to IA-32 processors with MMX and SSE multimedia extensions. PLX's datapath scalability feature often provides an additional 2x speedup in a cost-effective way.
引用
收藏
页码:253 / 264
页数:12
相关论文
共 50 条
  • [21] Evolution of a high-performance PC architecture data processing system
    Turri, M
    DASIA 99: DATA SYSTEMS IN AEROSPACE, 1999, 447 : 73 - 78
  • [22] MMR: A high-performance MultiMedia Router - Architecture and design trade-offs
    Duato, J
    Yalamanchili, S
    Caminero, MB
    Love, D
    Quiles, FJ
    FIFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1999, : 300 - 309
  • [23] Flag and Register Array Based High Performance Instruction Set Architecture of Embedded Processor
    Pandey, Bishwajeet
    Jain, Shalini
    Kumar, Mayank
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 716 - 720
  • [24] High-performance architecture
    Sherwin-Williams
    不详
    Finsh. Today, 2007, 2 (22-24):
  • [25] Low-power high-performance adaptive computing architectures for multimedia processing
    Sangireddy, R
    Kim, H
    Somani, AK
    HIGH PERFORMANCE COMPUTING - HIPC 2002, PROCEEDINGS, 2002, 2552 : 124 - 134
  • [26] The impact of x86 instruction set architecture on superscalar processing
    Rico, R
    Pérez, JI
    Frutos, JA
    JOURNAL OF SYSTEMS ARCHITECTURE, 2005, 51 (01) : 63 - 77
  • [27] High-performance multimedia applications and the internet
    Krikelis, A
    IEEE CONCURRENCY, 1998, 6 (03): : 17 - 19
  • [28] DESIGN OF A HIGH-PERFORMANCE INSTRUCTION PIPELINE
    OMONDI, AR
    COMPUTING SYSTEMS, 1991, 6 (01): : 13 - 29
  • [29] Window memoization: an efficient hardware architecture for high-performance image processing
    Khalvati, Farzad
    Aagaard, Mark D.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2010, 5 (03) : 195 - 212
  • [30] High-Performance Computing Architecture for Sample Value Processing in the Smart Grid
    Sun, Le
    Muguira, Leire
    Jimenez, Jaime
    Astarloa, Armando
    Lazaro, Jesus
    IEEE ACCESS, 2022, 10 : 12208 - 12218