Neural networks under SGDExplores the optimization of neural networks using Stochastic Gradient Descent (SGD) and the concept of dual risk versus empirical risk.
Information Measures: Part 2Covers information measures like entropy, joint entropy, and mutual information in information theory and data processing.