Module 3 – Machine Learning

Objectives

  • To learn the terminology associated with machine learning (e.g., train-test splits, cross-validation)
  • To learn how to execute an end-to-end machine learning pipeline using the scikit-learn library
  • To learn various techniques for inspecting how a machine learning model makes its decisions

IMPORTANT: This module focuses on “traditional” machine learning models like logistic regressors and random forest classifiers. It does not cover state-of-the-art deep learning like large language models (LLMs) or foundational models. However, the last lecture will show how the concepts covered in this module are prerequisite for these advanced model architectures.

Session Title Materials Exercises
Lecture 1: Machine Learning Terminology and Workflow
  • Crash Course on Machine Learning ( slides )
Lecture 2: Our First Models
  • Classification with an Image Dataset: Skin Cancer ( colab )
  • Regression with a Time-Series Dataset: Parkinsonian Gait ( colab )
Lecture 2 HW ( colab )
Lecture 3: Refining Our Models
  • Refining Our Image Classifier ( colab )
  • Refining Our Time-Series Regressor ( colab )
Lecture 3 HW ( colab )
Lecture 4: Interpretability and Bias
  • Motivation ( colab )
  • Naïve Baselines ( colab )
  • Feature Importance ( colab )
  • SHAP Values ( colab )
  • Quantifying Bias ( colab )
Lecture 4 HW ( colab )
Lecture 5: Flex Day
  • Primer on Image CNNs ( colab )