Messy Kitchen	Chef's Kitchen
`data.csv` in root folder	`data/raw/` and `data/processed/`
`train.py` next to `notes.txt`	`src/train.py`, `notebooks/explore.ipynb`
Hardcoded paths everywhere	`config.yaml` — one control panel
500 MB model committed to Git	`.gitignore` keeps it out

.gitignore — The Bouncer (Week 9 Recap)

Remember .gitignore from last week? It keeps junk out of Git.

Block This	Why
`data/`, `.csv`, `.h5`	Too large — Git is for code, not data
`models/.pkl`, `.pth`	Too large — save separately
`venv/`, `__pycache__/`	Generated — anyone can recreate
`.env`, `secrets.yaml`	Security risk — API keys, passwords
`.ipynb_checkpoints/`	Jupyter junk

data/
models/*.pkl
venv/
__pycache__/
.env
.ipynb_checkpoints/

Rule of thumb: if it's generated, large, or secret, it stays out.

Tool	Environment File	Manages Python Version?
venv (this course)	`requirements.txt`	No
Conda (data science)	`environment.yml`	Yes

Minecraft	Machine Learning
Seed number	`random_state=42`
Same seed = same world	Same seed = same split, same model
Share seed with friend = same map	Share seed with TA = same results

Reproducibility & Environments

Problem	Solution	One-liner
Messy folder, hardcoded paths	Project structure + `config.yaml`	Everything has a station
Dependency conflicts	`venv` + `requirements.txt`	Each project gets a soundproof studio
Different results each run	`random_state=42`	Same Minecraft seed = same world
Different OS, system libraries	Docker	Ship the whole container, not the parts list