Problem	Symptom	Solution
The Machine	"It works on my laptop but not yours"	Docker
The Math	"I get different results every run"	Seeds & determinism
The Memory	"Which of my 50 runs was the good one?"	Experiment tracking (TrackIO)

Without Docker	With Docker
45 mins of `pip install` errors	`docker run spam-app` (2 min)
"Which sklearn version?"	Frozen inside the container
"Are you on Windows?"	Doesn't matter

Concept	What It Is	Analogy
Image	Blueprint — OS + libraries + code (read-only)	Recipe card
Container	Running instance of an image	Dish being cooked
Dockerfile	Instructions to build an image	The recipe
Docker Hub	Registry of pre-built images	GitHub for images
Volumes	Shared folder between container and laptop	USB drive
Ports	How you access container's web apps	Window into the kitchen

	VM	Docker Container
Size	GBs (full OS)	MBs (just your app)
Startup	Minutes	Seconds
Isolation	Complete (own kernel)	Process-level (shared kernel)
Use case	Run a different OS	Package an app with dependencies
RAM	Each VM needs GBs	Containers share host memory

Image	What's Inside	Size
`python:3.10`	Full Debian + Python + compilers	~1 GB
`python:3.10-slim`	Minimal Debian + Python	~150 MB
`python:3.10-alpine`	Alpine Linux + Python	~50 MB
`ubuntu:22.04`	Just Ubuntu, no Python	~77 MB

Common Docker Errors

1. docker: command not found
→ Docker Desktop isn't running. Open the app first!

2. port is already allocated
→ Something else is using that port. Pick a different one: -p 7862:7860

3. COPY failed: file not found
→ The file isn't in the same folder as your Dockerfile. Check with ls.

4. Image is 3GB!
→ Use FROM python:3.10-slim instead of python:3.10

docker system prune   # clean up old images eating your disk

Minecraft	Machine Learning
Seed number	`random_state=42`
Same seed → same world	Same seed → same split, same model
Share seed with friend → same map	Share seed with TA → same results

Tab	What You See
Metrics	Training curves, overlaid runs
Media & Tables	Images, per-class tables
Runs	All configs and final metrics
System Metrics	GPU/CPU usage (auto on Apple Silicon)

Layer	Tool	What It Controls
The Machine	Docker	OS + system libs + everything
The Packages	`venv` + `requirements.txt`	Library versions
The Math	`random_state=42`	Algorithmic randomness
The Memory	TrackIO	What you tried & what worked

Reproducibility in Practice

Week 9: CS 203 - Software Tools and Techniques for AI

The Friday Night Nightmare

Three Problems, Three Solutions

The Machine

"Works on My Machine" — The Most Famous Lie in CS

The Shipping Container Analogy

The Shipping Container Analogy

Why Docker? Four Scenarios

Six Concepts — That's All

Image vs Container

Dockerfile, Image, Container — The Flow

Docker vs Virtual Machines

Docker vs VMs: The Numbers

Docker Hub: The App Store for Images

Docker Hub: Choosing the Right Image

The Dockerfile: Line by Line

Layer Caching: Why Second Build is Fast

Live Demos

Demo 1: Hello Docker

Demo 2: Reproducible Dependencies

Demo 3: Web App + Port Mapping

Demo 4: Volumes — Containers Have Amnesia

Demo 5: Environment Variables

Common Docker Errors

The 6 Docker Commands You Need

The Math

"Yesterday 92%, Today 85%, Same Code?!"

The Minecraft Seed

The Minecraft Seed

How to Lock the Universe

Seeds: The Limits

The Memory

The Spreadsheet Graveyard

TrackIO: Three Calls Is All You Need

TrackIO: Training Curves

TrackIO: Comparing Runs

TrackIO: Images & Tables

TrackIO: Alerts for Overfitting

TrackIO: The Dashboard

The Complete Reproducibility Stack

Key Takeaways