The PyTorch Training Loop

The 5-step training cycle (memorize this!):

This exact loop works for ANY neural network - from linear regression to GPT!

Supervised Learning

Deep Dive

Linear & Logistic Regression

Learning Goals

Recap: Supervised Learning

Part 1: Linear Regression

Finding the Best Line

The Simplest Prediction Problem

Visualizing the Data

The Pattern is Clear!

The Equation of a Line

What Does the Weight Mean?

What Does the Bias Mean?

Multiple Features: The General Form

Notation: Absorbing Bias into

Interpreting Multiple Weights

Part 2: Finding the Best Weights

The Optimization Problem

But What if Data Isn't Perfect?

The Goal: Minimize Errors

Why Squared Errors?

Mean Squared Error (MSE)

Two Ways to Find the Best Weights

Quick Review: Derivatives and Gradients

Gradient Example:

Can We Solve It Directly?

The Normal Equation

Gradient Descent

Gradient Descent: The Algorithm

Gradient Descent: A Worked Example

The Gradient for MSE: Intuition

Gradient Descent in NumPy

Learning Rate: The Key Hyperparameter

Learning Rate: The Hill Analogy

Why Gradient Descent Matters

Part 3: Feature Scaling

Why Scale Matters

The Scaling Problem

Why Scaling Helps Gradient Descent

Scaling: The Currency Analogy

Two Common Scaling Methods

Important: Fit on Train, Transform Both!

When to Scale?

Part 4: From sklearn to PyTorch

Building the Bridge

Linear Regression in sklearn

Understanding What sklearn Learned

The Same Thing in PyTorch!

Training in PyTorch

The PyTorch Training Loop

Comparing sklearn vs PyTorch

Part 4: Logistic Regression

From Numbers to Categories

A Different Problem

Why Can't We Use Linear Regression?

The Sigmoid Function

The Sigmoid Shape

Why Sigmoid? Let's Verify!

Logistic Regression Model

A Concrete Example

The Decision Rule

Logistic Regression in sklearn

Logistic Regression in PyTorch

Training Logistic Regression

Why Cross-Entropy Loss?

Cross-Entropy: The 4 Cases

Part 5: Feature Engineering

Making Linear Models More Powerful

The Limitation of Linear Models

Basis Functions: The Key Idea

Feature Expansion Visualized

Polynomial Features in sklearn

Works for Classification Too!

Summary: The Big Picture

Key Takeaways

You Now Understand the Basics!