Skip to main content

Search

Show all results for

Home

Lecture

How to change the policy with a gradient method

About
Privacy
Disclaimer

Copyright © 2026 EPFL, all rights reserved

Graph Chatbot

Description

This lecture covers the concept of optimizing the total expected reward by directly associating stimuli with actions and adjusting the policy using gradient methods. It explains how to change the policy to maximize the total reward based on neural responses and examples.

Official source

https://mediaspace.epfl.ch/media/0_a0q5qzf9

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (32)

Optimization of Neuroprosthetic Systems

Explores the optimization of neuroprosthetic systems, including sensory feedback restoration and neural stimulation strategies.

Data-Driven Modeling in Neuroscience: Meenakshi Khosla

By Meenakshi Khosla explores data-driven modeling in large-scale naturalistic neuroscience, focusing on brain activity representation and computational models.

Neural Networks: Hierarchical Models and Odor Taxis

Covers neural function, hierarchical models, odor taxis behaviors, and disparate circuit parameters in 18 slides.

Engineering Neurons: Light, Chemicals, Sound

Explores optogenetics, chemogenetics, and sonogenetics to engineer neural activity using light, chemicals, and sound.

Neural Signals and Signal Processing

Explores neuronal signals, brain organization, measurement techniques, and MRI principles.