Publication

Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function

Publications associées (61)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Online Adaptive Methods, Universality and Acceleration

Volkan Cevher, Alp Yurtsever

We present a novel method for convex unconstrained optimization that, without any modifications, ensures: (i) accelerated convergence rate for smooth objectives, (ii) standard convergence rate in the general (non-smooth) setting, and (iii) standard converg ...

2018

On the linear convergence of the stochastic gradient method with constant step-size

Volkan Cevher, Cong Bang Vu

The strong growth condition (SGC) is known to be a sufficient condition for linear convergence of the stochastic gradient method using a constant step-size γ (SGM-CS). In this paper, we provide a necessary condition, for the linear convergence of SGM-CS, t ...

2018

Online Adaptive Methods, Universality and Acceleration

Volkan Cevher, Alp Yurtsever

NEURAL INFORMATION PROCESSING SYSTEMS (NIPS)2018

Smooth Primal-Dual Coordinate Descent Algorithms for Nonsmooth Convex Optimization

Volkan Cevher, Quoc Tran Dinh, Ahmet Alacaoglu

We propose a new randomized coordinate descent method for a convex optimization template with broad applications. Our analysis relies on a novel combination of four ideas applied to the primal-dual gap function: smoothing, acceleration, homotopy, and coord ...

2017

Stochastic gradient descent with finite samples sizes

Ali H. Sayed, Stefan Vlaski, Bicheng Ying, Kun Yuan

The minimization of empirical risks over finite sample sizes is an important problem in large-scale machine learning. A variety of algorithms has been proposed in the literature to alleviate the computational burden per iteration at the expense of converge ...

IEEE2016

Adaptive data augmentation for image classification

Pascal Frossard, Alhussein Fawzi

Data augmentation is the process of generating samples by transforming training data, with the target of improving the accuracy and robustness of classifiers. In this paper, we propose a new automatic and adaptive algorithm for choosing the transformations ...

IEEE2016

Theory of representation learning in cortical neural networks

Carlos Stein Naves de Brito

Our brain continuously self-organizes to construct and maintain an internal representation of the world based on the information arriving through sensory stimuli. Remarkably, cortical areas related to different sensory modalities appear to share the same f ...

EPFL2016

Stochastic Spectral Descent for Discrete Graphical Models

Volkan Cevher, Ya-Ping Hsieh, Edo Collins

Interest in deep probabilistic graphical models has increased in recent years, due to their state-of-the-art perfor- mance on many machine learning applications. Such models are typically trained with the stochastic gradient method, which can take a signif ...

Ieee-Inst Electrical Electronics Engineers Inc2016

Stochastic Spectral Descent for Restricted Boltzmann Machines.

Volkan Cevher

Restricted Boltzmann Machines (RBMs) are widely used as building blocks for deep learning models. Learning typically proceeds by using stochastic gradient descent, and the gradients are estimated with sampling methods. However, the gradient estimation is a ...

2015

Point localization in Multi-camera system

Alireza Ghasemi, Soumyabrata Dev

Point localization in multi-camera setups has been widely studied in computer vision. Recently, in finite-resolution camera settings, a consistent and optimal point localization algorithm called SHARP has been proposed, under the assumption of noiseless ca ...

pas d'éditeur2015