-
Sparse Communication for Training Deep Networks
Synchronous stochastic gradient descent (SGD) is the most common method ...
read it

Negar Foroutan Eghlidi
is this you? claim profile
Synchronous stochastic gradient descent (SGD) is the most common method ...
read it