This work studies post-training parameter quantization in large language...
We study the role of magnitude structured pruning as an architecture sea...
We introduce a principled approach to neural network pruning that casts ...
Low-precision arithmetic trains deep learning models using less energy, ...
Convergence detection of iterative stochastic optimization methods is of...
Iterative procedures in stochastic optimization are typically comprised ...