Step-by-Step: Model Selection

Model Selection

& Evaluation

Why Models Fail and How to Fix It

The Story So Far

The Problem We're Solving

Today's Questions

Part 1: The Two Ways Models Fail

Underfitting & Overfitting

A Simple Example

Model 1: Too Simple (Underfitting)

Model 2: Too Complex (Overfitting)

Model 3: Just Right

The Key Insight

Visual: Polynomial Fitting Example

What Controls Complexity?

Underfitting: Like a Student Who Skipped Class

Overfitting: Like a Student Who Memorized Without Understanding

Analogy: Studying for an Exam

Another Analogy: Fitting Clothes

More Everyday Analogies

Signs You're Overfitting

Signs You're Underfitting

Real-World Overfitting: COVID X-Ray Detection

How Much Data is Enough?

Visual: The Complexity Tradeoff

The Bias-Variance Tradeoff

The Sweet Spot

Part 2: Detecting the Problem

Train, Validation, and Test Sets

Why We Need Multiple Sets

The Solution: Hold Out Test Data

Train vs Test Error

The Gap Tells You Everything

What About Model Selection?

Three-Way Split

The Three Sets

In Code

Using the Three Sets

Part 3: Cross-Validation

Getting Reliable Performance Estimates

Why Do We Need Cross-Validation?

The Problem with Single Splits

K-Fold Cross-Validation: The Solution

K-Fold: A Concrete Example

Why K-Fold Works

K-Fold in sklearn

Choosing K: Trade-offs

But Wait... What About Hyperparameters?

The Data Leakage Problem

Data Leakage: A Hiring Analogy

Nested Cross-Validation: The Fix

How Nested CV Works

Nested CV in sklearn

Summary: When to Use What

Part 4: Practical Guidelines

Making Good Model Choices

The Model Selection Workflow

Step-by-Step: Model Selection

Common Mistakes to Avoid

Common Hyperparameters

Simple Hyperparameter Tuning

What to Report

Red Flags to Watch For

Summary: The Key Ideas

Regularization: Preventing Overfitting

What is Regularization?

Regularization Intuition

Types of Regularization

When to Use Regularization?

Learning Curves: Diagnosing Problems

Reading Learning Curves

Grid Search: Automated Tuning

What We Skipped (Advanced Topics)

Code Summary

PyTorch: Data Splitting

PyTorch: DataLoaders

PyTorch: K-Fold Cross-Validation

Key Takeaways

Questions?

Next Lecture: Neural Networks