Variance-Reduced Stochastic Learning Under Random Reshuffling

Several useful variance-reduced stochastic gradient algorithms, such as SVRG, SAGA, Finito, and SAG, have been proposed to minimize empirical risks with linear convergence properties to the exact minimizer. The existing convergence results assume uniform data sampling with replacement. However, it has been observed in related works that random reshuffling can deliver superior performance over uniform sampling and, yet, no formal proofs or guarantees of exact convergence exist for variance-reduced algorithms under random reshuffling. This paper makes two contributions. First, it provides a theoretical guarantee of linear convergence under random reshuffling for SAGA in the mean-square sense; the argument is also adaptable to other variance-reduced algorithms. Second, under random reshuffling, the article proposes a new amortized variance-reduced gradient (AVRG) algorithm with constant storage requirements compared to SAGA and with balanced gradient computations compared to SVRG. AVRG is also shown analytically to converge linearly.

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Variance-Reduced Stochastic Learning Under Random Reshuffling

Graph Chatbot

Chattez avec Graph Search

Analytical Computation of the Sensitivity Coefficients in Hybrid AC/DC Networks

Statistical Inference for Inverse Problems: From Sparsity-Based Methods to Neural Networks

A Statistical Framework to Investigate the Optimality of Signal-Reconstruction Methods

Statistical Inference for Inverse Problems: From Sparsity-Based Methods to Neural Networks

Analytical Computation of the Sensitivity Coefficients in Hybrid AC/DC Networks

A Statistical Framework to Investigate the Optimality of Signal-Reconstruction Methods