Cross-Validation Summary

From Models to Experiments

Week 7: CS 203 - Software Tools and Techniques for AI

Part 1: Refresher

The Story So Far

The Complexity Ladder

Bias-Variance Tradeoff

Overfitting vs Underfitting

Diagnosing Your Model

Regularization: One Slide

Part 2: Cross-Validation

The Problem

Why Does This Happen?

K-Fold Cross-Validation: The Fix

K-Fold: How It Works

K-Fold in Code

How to Report Results

Choosing K

Stratified K-Fold

When Standard K-Fold Breaks

Data Leakage: The #1 CV Mistake

Other Common Leakage Sources

Learning Curves

Validation Curves

Cross-Validation Summary

Part 3: Hyperparameter Tuning

Parameters vs Hyperparameters

Common Hyperparameters

Approach 0: "Grad Student Descent"

Approach 1: Grid Search

Grid Search: The Explosion Problem

Approach 2: Random Search

Why Random Search Works Better

Random Search in Code

Grid vs Random: When to Use Which

Approach 3: Bayesian Optimization

Optuna: Bayesian Optimization Made Easy

Optuna: Built-In Visualizations

Comparison: All Three Approaches

The Tuning Trap: Evaluating Tuned Models

Nested Cross-Validation

Nested CV in Code

Hyperparameter Tuning: Best Practices

Common Tuning Mistakes

Part 4: Experiment Tracking

The Notebook Graveyard

What to Track

Part 5: AutoML

The Manual Process We Just Learned

What AutoML Does

AutoGluon: 3 Lines of Code

What Happens Inside

AutoGluon Leaderboard

AutoGluon Presets

When to Use AutoML

AutoML vs Manual: The Spectrum

The Complete Workflow

Key Takeaways & Exam Prep

Key Takeaways

Exam Questions

More Exam Questions

Lab Preview

Questions?