Delves into the fundamental limits of gradient-based learning on neural networks, covering topics such as binomial theorem, exponential series, and moment-generating functions.
Introduces Bayesian estimation, covering classical versus Bayesian inference, conjugate priors, MCMC methods, and practical examples like temperature estimation and choice modeling.