Explores kernels for simplifying data representation and making it linearly separable in feature spaces, including popular functions and practical exercises.
Covers optimization in machine learning, focusing on gradient descent for linear and logistic regression, stochastic gradient descent, and practical considerations.