Original paper

Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning

Volume: 33, Issue: 01, Pages: 5693 - 5700
Published: Jul 17, 2019
© 2025 Pluto Labs All rights reserved.