Stochastic Gradient Descent: Non-convex Optimization Techniques

Description

This lecture covers Stochastic Gradient Descent (SGD) and its application in non-convex optimization. It begins with an introduction to SGD, explaining its efficiency in handling sum-structured objective functions, where the cost function is derived from multiple observations. The instructor details the algorithm, emphasizing the benefits of using stochastic gradients over full gradients, which significantly reduces computational costs. The lecture further explores the concept of unbiasedness in stochastic gradients and presents theorems regarding convergence rates under various conditions, including bounded stochastic gradients and strong convexity. The discussion extends to mini-batch SGD, highlighting its advantages in variance reduction and parallelization. The lecture also addresses challenges in non-convex optimization, such as local minima and saddle points, and introduces concepts of smooth functions and bounded Hessians. Finally, the instructor discusses the implications of these techniques in machine learning, providing a comprehensive understanding of optimization strategies in complex scenarios.

Login to watch the video

Instructors (2)

irure duis tempor

Qui qui adipisicing commodo nisi fugiat. Quis ullamco non et exercitation tempor duis veniam veniam excepteur ex minim. Magna occaecat duis magna do laboris esse nulla aliquip irure cupidatat incididunt labore qui. Cillum quis ipsum cillum nisi commodo sit. Qui Lorem occaecat ullamco laboris proident.

enim veniam

Adipisicing ad velit occaecat in sit. Sunt incididunt deserunt consectetur ut culpa sunt. Incididunt est esse minim culpa cupidatat incididunt excepteur in ipsum labore. Minim non quis quis mollit culpa excepteur enim et nulla excepteur voluptate ea. Ad culpa esse proident cillum. Proident enim tempor excepteur aliqua nostrud ad minim non laborum cupidatat elit anim occaecat. Aute sit et in mollit reprehenderit tempor qui.

Official source

https://mediaspace.epfl.ch/media/0_esmfeb2j

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Stochastic Gradient Descent: Non-convex Optimization Techniques

Graph Chatbot

Chat with Graph Search