research
∙
06/09/2021
Communication-efficient SGD: From Local SGD to One-Shot Averaging
We consider speeding up stochastic gradient descent (SGD) by parallelizi...
research
∙
06/03/2020