Exceeding Conservative Limits: A Consolidated Analysis on Modern Hardware Margins

06/01/2020
by   George Papadimitriou, et al.
0

Modern large-scale computing systems (data centers, supercomputers, cloud and edge setups and high-end cyber-physical systems) employ heterogeneous architectures that consist of multicore CPUs, general-purpose many-core GPUs, and programmable FPGAs. The effective utilization of these architectures poses several challenges, among which a primary one is power consumption. Voltage reduction is one of the most efficient methods to reduce power consumption of a chip. With the galloping adoption of hardware accelerators (i.e., GPUs and FPGAs) in large datacenters and other large-scale computing infrastructures, a comprehensive evaluation of the safe voltage reduction levels for each different chip can be employed for efficient reduction of the total power. We present a survey of recent studies in voltage margins reduction at the system level for modern CPUs, GPUs and FPGAs. The pessimistic voltage guardbands inserted by the silicon vendors can be exploited in all devices for significant power savings. On average, voltage reduction can reach 12 20

READ FULL TEXT

page 6

page 9

page 10

research
08/23/2022

Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems

Scientists are increasingly exploring and utilizing the massive parallel...
research
12/26/2017

The L-CSC cluster: greenest supercomputer in the world in Green500 list of November 2014

The L-CSC (Lattice Computer for Scientific Computing) is a general purpo...
research
11/28/2018

The L-CSC cluster: Optimizing power efficiency to become the greenest supercomputer in the world in the Green500 list of November 2014

The L-CSC (Lattice Computer for Scientific Computing) is a general purpo...
research
09/13/2020

Efficiency Near the Edge: Increasing the Energy Efficiency of FFTs on GPUs for Real-time Edge Computing

The Square Kilometre Array (SKA) is an international initiative for deve...
research
03/08/2020

Towards Green Computing: A Survey of Performance and Energy Efficiency of Different Platforms using OpenCL

When considering different hardware platforms, not just the time-to-solu...
research
07/04/2023

The Path to Fault- and Intrusion-Resilient Manycore Systems on a Chip

The hardware computing landscape is changing. What used to be distribute...
research
09/01/2019

Challenges of Reliability Assessment and Enhancement in Autonomous Systems

The gigantic complexity and heterogeneity of today's advanced cyber-phys...

Please sign up or login with your details

Forgot password? Click here to reset