Refining instruction set architecture for high-performance multimedia processing in constrained environments

被引:6
|
作者
Lee, RB [1 ]
Fiskiran, AM [1 ]
Shi, ZJ [1 ]
Yang, M [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, PALMS, Princeton, NJ 08544 USA
关键词
D O I
10.1109/ASAP.2002.1030724
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modern microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, special-purpose functional units that are not practical for constrained environments such as handheld multimedia information appliances. For such environments, low-power and low-cost are as important as the high performance required for real-time multimedia processing and the general-purpose programmability required to support an ever growing range of applications. In this paper, we introduce PLX, a concise ISA that selects the most useful features from the first two generations of multimedia instructions added to microprocessors, and explores new ISA features for high-performance yet low-cost multimedia processing with small footprint processors. PLX is unique in that it is designed from scratch as a fully subword-parallel architecture with novel features like datapath scalability from 32-bit to 128-bit words, and a new definition of predication for reducing conditional branches. We illustrate the use of PLX's architectural features with four frequently used multimedia kernels: discrete cosine transform, pixel padding, clip test and median filter. Our performance results show that a 64-bit PLX implementation achieves significant speedups compared to a basic 64-bit RISC processor and to IA-32 processors with MMX and SSE multimedia extensions. PLX's datapath scalability feature often provides an additional 2x speedup in a cost-effective way.
引用
收藏
页码:253 / 264
页数:12
相关论文
共 50 条
  • [31] Dynamically reconfigurable dataflow architecture for high-performance digital signal processing
    Voigt, S.
    Baesler, M.
    Teufel, T.
    JOURNAL OF SYSTEMS ARCHITECTURE, 2010, 56 (11) : 561 - 576
  • [32] A novel switch architecture for high-performance computing and signal processing networks
    Sukhtankar, S
    Hecht, D
    Rosen, W
    THIRD IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2004, : 215 - 222
  • [33] Simulation experiments of a high-performance RapidIO-based processing architecture
    Adams, J
    Katsinis, C
    Rosen, W
    Hecht, D
    Adams, V
    Narravula, HV
    Sukhtankar, S
    Lachenmaier, R
    IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2001, : 336 - 339
  • [34] Window memoization: an efficient hardware architecture for high-performance image processing
    Farzad Khalvati
    Mark D. Aagaard
    Journal of Real-Time Image Processing, 2010, 5 : 195 - 212
  • [35] Two-level reconfigurable architecture for high-performance signal processing
    Johnsson, D
    Bengtsson, J
    Svensson, B
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 177 - 183
  • [36] MC2:: High-performance garbage collection for memory-constrained environments
    Sachindran, N
    Moss, JEB
    Berger, ED
    ACM SIGPLAN NOTICES, 2004, 39 (10) : 81 - 98
  • [37] A high performance router architecture for multimedia applications
    Orduna, JM
    Duato, J
    FIFTH INTERNATIONAL CONFERENCE ON MASSIVELY PARALLEL PROCESSING, PROCEEDINGS, 1998, : 142 - 149
  • [38] A high performance processor architecture for multimedia applications
    Khan, Shafqat
    Rashid, Muhammad
    Javaid, Faraz
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 66 : 14 - 29
  • [39] Algorithm-based low-power and high-performance multimedia signal processing
    Liu, KJR
    Wu, AY
    Raghupathy, A
    Chen, J
    PROCEEDINGS OF THE IEEE, 1998, 86 (06) : 1155 - 1202
  • [40] High-performance protocol architecture
    Dabbous, WS
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (07): : 735 - 744