Beyond Desktop Computation: Challenges in Scaling a GPU Infrastructure

10/11/2021
by   Martin Uray, et al.
17

Enterprises and labs performing computationally expensive data science applications sooner or later face the problem of scale but unconnected infrastructure. For this up-scaling process, an IT service provider can be hired or in-house personnel can attempt to implement a software stack. The first option can be quite expensive if it is just about connecting several machines. For the latter option often experience is missing with the data science staff in order to navigate through the software jungle. In this technical report, we illustrate the decision process towards an on-premises infrastructure, our implemented system architecture, and the transformation of the software stack towards a scaleable GPU cluster system.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset