This lecture delves into the relationship between neural learning rules and the derivative of the logarithm of a policy, exploring the post-synaptic and pre-synaptic contributions in weight updates. It explains how rewards modulate synaptic weights and how the activity of neurons influences learning.