Refining instruction set architecture for high-performance multimedia processing in constrained environments

被引:6
|
作者
Lee, RB [1 ]
Fiskiran, AM [1 ]
Shi, ZJ [1 ]
Yang, M [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, PALMS, Princeton, NJ 08544 USA
关键词
D O I
10.1109/ASAP.2002.1030724
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modern microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, special-purpose functional units that are not practical for constrained environments such as handheld multimedia information appliances. For such environments, low-power and low-cost are as important as the high performance required for real-time multimedia processing and the general-purpose programmability required to support an ever growing range of applications. In this paper, we introduce PLX, a concise ISA that selects the most useful features from the first two generations of multimedia instructions added to microprocessors, and explores new ISA features for high-performance yet low-cost multimedia processing with small footprint processors. PLX is unique in that it is designed from scratch as a fully subword-parallel architecture with novel features like datapath scalability from 32-bit to 128-bit words, and a new definition of predication for reducing conditional branches. We illustrate the use of PLX's architectural features with four frequently used multimedia kernels: discrete cosine transform, pixel padding, clip test and median filter. Our performance results show that a 64-bit PLX implementation achieves significant speedups compared to a basic 64-bit RISC processor and to IA-32 processors with MMX and SSE multimedia extensions. PLX's datapath scalability feature often provides an additional 2x speedup in a cost-effective way.
引用
收藏
页码:253 / 264
页数:12
相关论文
共 50 条
  • [41] Algorithm-based low-power and high-performance multimedia signal processing
    Univ of Maryland, College Park, United States
    Proc IEEE, 6 (1155-1202):
  • [42] HIGH-PERFORMANCE ARCHITECTURE ISSUES
    NICOLE, DA
    DECENTRALIZED AND DISTRIBUTED SYSTEMS, 1993, 39 : 23 - 30
  • [43] HIGH-PERFORMANCE COMPUTER ARCHITECTURE
    BHUYAN, LN
    FUTURE GENERATION COMPUTER SYSTEMS, 1995, 11 (06) : 501 - 502
  • [44] Natural instruction level parallelism-aware compiler for high-performance QueueCore processor architecture
    Ben Abdallah Abderazek
    Masashi Masuda
    Arquimedes Canedo
    Kenichi Kuroda
    The Journal of Supercomputing, 2011, 57 : 314 - 338
  • [45] Natural instruction level parallelism-aware compiler for high-performance QueueCore processor architecture
    Abderazek, Ben Abdallah
    Masuda, Masashi
    Canedo, Arquimedes
    Kuroda, Kenichi
    JOURNAL OF SUPERCOMPUTING, 2011, 57 (03): : 314 - 338
  • [46] A high-performance multi-purpose DSP architecture for signal processing research
    Morrison, SA
    Parks, JS
    Gugel, KS
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 601 - 604
  • [47] Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
    Qingxiang Chen
    Long Zheng
    Xiaofei Liao
    Hai Jin
    Qinggang Wang
    CCF Transactions on High Performance Computing, 2020, 2 : 362 - 375
  • [48] Merging VLIW and vector processing techniques for a simple, high-performance processor architecture
    Soliman, Mostafa I.
    MICROELECTRONICS JOURNAL, 2015, 46 (07) : 637 - 655
  • [49] High-performance FPGA Architecture for Data Streams Processing on Example of IPsec Gateway
    Korona, Mateusz
    Skowron, Krzysztof
    Trzepinski, Mateusz
    Rawski, Mariusz
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2018, 64 (03) : 351 - 356
  • [50] Software architecture exploration for high-performance security processing on a multiprocessor mobile SoC
    Arora, Divya
    Raghunathan, Anand
    Ravi, Srivaths
    Sankaradass, Murugan
    Jha, Niraj K.
    Chakradhar, Srimat T.
    43RD DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2006, 2006, : 496 - +