Science Cast

Stability and Generalization for Minibatch SGD and Local SGD

Yunwen LeiOctober 7, 2023 7:32pm

Views (44)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Stability and Generalization for Minibatch SGD and Local SGD

arXivPDFOctober 2, 2023 12:00am

Authors

Yunwen Lei, Tao Sun, Mingrui Liu

Abstract

The increasing scale of data propels the popularity of leveraging parallelism to speed up the optimization. Minibatch stochastic gradient descent (minibatch SGD) and local SGD are two popular methods for parallel optimization. The existing theoretical studies show a linear speedup of these methods with respect to the number of machines, which, however, is measured by optimization errors. As a comparison, the stability and generalization of these methods are much less studied. In this paper, we pioneer the stability and generalization analysis of minibatch and local SGD to understand their learnability. We incorporate training errors into the stability analysis, which shows how small training errors help generalization for overparameterized models. Our stability bounds imply optimistic risk bounds which decay fast under a low noise condition. We show both minibatch and local SGD achieve a linear speedup to attain the optimal risk bounds.

TwitterandLinkedIn

0 comments

Add comment

Stability and Generalization for Minibatch SGD and Local SGD

Stability and Generalization for Minibatch SGD and Local SGD

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments