Skip to main content
Publication

On the Trade-off between Flatness and Optimization in Distributed Learning