CG-OoO: Energy-Efficient Coarse-Grain Out-of-Order Execution

06/06/2016
by   Milad Mohammadi, et al.
0

We introduce the Coarse-Grain Out-of-Order (CG- OoO) general purpose processor designed to achieve close to In-Order processor energy while maintaining Out-of-Order (OoO) performance. CG-OoO is an energy-performance proportional general purpose architecture that scales according to the program load. Block-level code processing is at the heart of the this architecture; CG-OoO speculates, fetches, schedules, and commits code at block-level granularity. It eliminates unnecessary accesses to energy consuming tables, and turns large tables into smaller and distributed tables that are cheaper to access. CG-OoO leverages compiler-level code optimizations to deliver efficient static code, and exploits dynamic instruction-level parallelism and block-level parallelism. CG-OoO introduces Skipahead issue, a complexity effective, limited out-of-order instruction scheduling model. Through the energy efficiency techniques applied to the compiler and processor pipeline stages, CG-OoO closes 64 processors at the performance of the OoO baseline. This makes CG-OoO 1.9x more efficient than the OoO on the energy-delay product inverse metric.

READ FULL TEXT

page 8

page 9

research
09/07/2021

Efficient Instruction Scheduling using Real-time Load Delay Tracking

Many hardware structures in today's high-performance out-of-order proces...
research
06/20/2020

Compiler Directed Speculative Intermittent Computation

This paper presents CoSpec, a new architecture/compiler co-design scheme...
research
12/13/2021

Slowing Down for Performance and Energy: An OS-Centric Study in Network Driven Workloads

This paper studies three fundamental aspects of an OS that impact the pe...
research
08/21/2018

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks

Statically estimating the number of processor clock cycles it takes to e...
research
12/19/2022

PEZY-SC3: A MIMD Many-core Processor for Energy-efficient Computing

PEZY-SC3 is a highly energy- and area-efficient processor for supercompu...
research
04/13/2019

Evaluation of the RIKEN Post-K Processor Simulator

For the purpose of developing applications for Post-K at an early stage,...
research
01/21/2021

Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors

Focal-plane Sensor-processors (FPSPs) are a camera technology that enabl...

Please sign up or login with your details

Forgot password? Click here to reset