Ali H. Sayed, Stefan Vlaski, Bicheng Ying, Kun Yuan
In empirical risk optimization, it has been observed that stochastic gradient implementations that rely on random reshuffling of the data achieve better performance than implementations that rely on sampling the data uniformly. Recent works have pursued ju ...
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2019