Convergence Of Variance-Reduced Learning Under Random Reshuffling

Several useful variance-reduced stochastic gradient algorithms, such as SVRG, SAGA, Finito, and SAG, have been proposed to minimize empirical risks with linear convergence properties to the exact minimizers. The existing convergence results assume uniform data sampling with replacement. However, it has been observed that random reshuffling can deliver superior performance and, yet, no formal proofs or guarantees of exact convergence exist for variance-reduced algorithms under random reshuffling. This paper makes two contributions. First, it resolves this open issue and provides the first theoretical guarantee of linear convergence under random reshuffling for SAGA; the argument is also adaptable to other variance-reduced algorithms. Second, under random reshuffling, the paper proposes a new amortized variance-reduced gradient (AVRG) algorithm with constant storage requirements compared to SAGA and with balanced gradient computations compared to SVRG. AVRG is also shown analytically to converge linearly.

Convergence Of Variance-Reduced Learning Under Random Reshuffling

Graph Chatbot

Chat with Graph Search

Analytical Computation of the Sensitivity Coefficients in Hybrid AC/DC Networks

SC-TPTP: An Extension of the TPTP Derivation Format for Sequent-Based Calculus

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Analytical Computation of the Sensitivity Coefficients in Hybrid AC/DC Networks

Generalization of Scaled Deep ResNets in the Mean-Field Regime

SC-TPTP: An Extension of the TPTP Derivation Format for Sequent-Based Calculus