Dynamic Instruction Scheduling in a Trace-based Multi-threaded Architecture

被引:0
|
作者
Peter A. Rounce
Alberto F. De Souza
机构
[1] University College London,Department of Computer Science
[2] Universidade Federal do Espírito Santo,Departamento de Informática
关键词
Simultaneous multi-threading; Dynamic instruction scheduling; Wide issue architectures; VLIW;
D O I
暂无
中图分类号
学科分类号
摘要
Simulation results are presented using the hardware-implemented, trace-based dynamic instruction scheduler of our single process DTSVLIW architecture to schedule instructions from several processes into multiple streams of VLIW instructions for execution by a wide-issue, simultaneous multi-threading (SMT) execution engine. The scheduling process involves single instruction execution of each process, dynamically scheduling executed instructions into blocks of VLIW instructions cached for subsequent SMT execution: SMT provides a mechanism to reduce the impact of horizontal and vertical waste, and variable memory latencies, seen in the DTSVLIW. Preliminary experiments explore this extended model. Results achieve PE utilization of up to 87% on a 4-thread, 1-scalar, 8 PE design, with speed-ups of up to 6.3 that of a single processor. Noticeably it only needs a single scalar process to be scheduled at any time, with main memory fetches being 1–4% that of a single processor.
引用
收藏
页码:184 / 205
页数:21
相关论文
共 50 条
  • [1] Dynamic instruction scheduling in a trace-based multi-threaded architecture
    Rounce, Peter A.
    De Souza, Alberto F.
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2008, 36 (02) : 184 - 205
  • [2] The mDTSVLIW: a multi-threaded trace-based VLIW architecture
    Rounce, P.
    De Souza, A. F.
    SBAC-OAD 2006: 18TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, 2006, : 63 - 70
  • [3] Global multi-threaded instruction scheduling
    Ottoni, Guilherme
    August, David I.
    MICRO-40: PROCEEDINGS OF THE 40TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2007, : 56 - 68
  • [4] Communication optimizations for global multi-threaded instruction scheduling
    Ottoni, Guilherme
    August, David I.
    ACM SIGPLAN NOTICES, 2008, 43 (03) : 222 - 232
  • [5] Trace-based runtime instruction rescheduling for architecture extension
    Tang, YX
    Deng, K
    Cao, HJ
    Zhou, XM
    EMBEDDED SOFTWARE AND SYSTEMS, PROCEEDINGS, 2005, 3820 : 4 - 15
  • [6] On-line multi-threaded scheduling
    Feuerstein, E
    Mydlarz, M
    Stougie, L
    JOURNAL OF SCHEDULING, 2003, 6 (02) : 167 - 181
  • [7] On-line Multi-threaded Scheduling
    Esteban Feuerstein
    Marcelo Mydlarz
    Leen Stougie
    Journal of Scheduling, 2003, 6 : 167 - 181
  • [8] A reconfigurable multi-threaded architecture model
    Wallner, S
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, 2003, 2823 : 193 - 207
  • [9] Tail queues: A multi-threaded matching architecture
    Dosanjh, Matthew G. F.
    Grant, Ryan E.
    Schonbein, Whit
    Bridges, Patrick G.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (03):
  • [10] The 'uniform heterogeneous multi-threaded' processor architecture
    Towner, D
    May, D
    COMMUNICATING PROCESS ARCHITECTURES 2001, 2001, 59 : 103 - 116