Project	Python	TensorFlow	NumPy
Netflix Predictor	3.10	2.12	1.24
Old School Project	3.8	1.15	1.19
Your System	3.11	???	???

Creating a Virtual Environment

Step 1: Create the environment

python -m venv netflix_env

Step 2: Activate it

# Mac/Linux
source netflix_env/bin/activate

# Windows
netflix_env\Scripts\activate

Step 3: Your prompt changes

(netflix_env) $ python --version
Python 3.10.12

Now you're in the Netflix room!

Feature	venv	Conda
Built into Python	Yes	No (install separately)
Manage Python versions	No	Yes
Non-Python packages	No	Yes (CUDA, etc.)
Speed	Fast	Slower
File	requirements.txt	environment.yml

Term	What It Is	Analogy
Image	Blueprint/template	Recipe
Container	Running instance	Cooked dish
Dockerfile	Instructions to build image	Recipe card
Registry	Store for images	Recipe book

Reproducibility & Environments

Week 8 · CS 203: Software Tools and Techniques for AI

The "Works on My Machine" Problem

Why Reproducibility Matters

The Reproducibility Spectrum

Connection to Our Netflix Project

Part 1: Virtual Environments

The Problem: Dependency Conflicts

Virtual Environments: The Concept

Creating a Virtual Environment

Installing Packages in Your Environment

requirements.txt: Your Shopping List

Good vs Bad requirements.txt

Conda: An Alternative

venv vs Conda: Which to Use?

Part 2: Random Seeds

The Randomness Problem

What's Random in ML?

Setting Random Seeds

A Complete Seed Function

Don't Forget random_state!

Part 3: Docker Basics

Virtual Environments Aren't Enough

Docker: Package Everything

Docker Concepts

Your First Dockerfile

Building and Running

Common Docker Commands

When to Use Docker

Part 4: Project Structure

A Reproducible Project Structure

The README: Your Project's Front Door

Configuration Files

Loading Config Files

.gitignore: What NOT to Track

Part 5: Putting It Together

Reproducibility Checklist

Quick Setup Script

Netflix Project: Reproducibility

Key Takeaways

Common Mistakes

Lab Preview

Interview Questions

Questions?