Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

07/12/2017
by   Chansup Byun, et al.
0

Knights Landing (KNL) is the code name for the second-generation Intel Xeon Phi product family. KNL has generated significant interest in the data analysis and machine learning communities because its new many-core architecture targets both of these workloads. The KNL many-core vector processor design enables it to exploit much higher levels of parallelism. At the Lincoln Laboratory Supercomputing Center (LLSC), the majority of users are running data analysis applications such as MATLAB and Octave. More recently, machine learning applications, such as the UC Berkeley Caffe deep learning framework, have become increasingly important to LLSC users. Thus, the performance of these applications on KNL systems is of high interest to LLSC users and the broader data analysis and machine learning communities. Our data analysis benchmarks of these application on the Intel KNL processor indicate that single-core double-precision generalized matrix multiply (DGEMM) performance on KNL systems has improved by 3.5x compared to prior Intel Xeon technologies. Our data analysis applications also achieved 60 Also a performance comparison of a machine learning application, Caffe, between the two different Intel CPUs, Xeon E5 v3 and Xeon Phi 7210, demonstrated a 2.7x improvement on a KNL node.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2019

Simulating Nonlinear Neutrino Oscillations on Next-Generation Many-Core Architectures

In this work an astrophysical simulation code, XFLAT, is developed to st...
research
09/24/2020

Investigating Applications on the A64FX

The A64FX processor from Fujitsu, being designed for computational simul...
research
07/06/2019

Optimizing Xeon Phi for Interactive Data Analysis

The Intel Xeon Phi manycore processor is designed to provide high perfor...
research
04/03/2018

Vanlearning: A Machine Learning SaaS Application for People Without Programming Backgrounds

Although we have tons of machine learning tools to analyze data, most of...
research
07/20/2018

Interactive Supercomputing on 40,000 Cores for Machine Learning and Data Analysis

Interactive massively parallel computations are critical for machine lea...
research
10/10/2018

Performance analysis and optimization of the JOREK code for many-core CPUs

This report investigates the performance of the JOREK code on the Intel ...
research
02/23/2017

First Experiences Optimizing Smith-Waterman on Intel's Knights Landing Processor

The well-known Smith-Waterman (SW) algorithm is the most commonly used m...

Please sign up or login with your details

Forgot password? Click here to reset