Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers decision trees for classification, supervised learning, measuring the quality of a split using entropy, and the concept of information gain. It also includes a demonstration in Tableau & KNIME, one-hot encoding, hyperparameter optimization, and the use of random forests. The instructor explains the process of optimizing hyperparameters, one-hot encoding with scikit-learn and pandas, and the importance of choosing the right model. The lecture concludes with an overview of cross-validation, model evaluation, and the sklearn.tree.DecisionTreeClassifier parameters.