Generator in PyTorch · DCGAN

class Generator(nn.Module):
    def __init__(self, nz=100, ngf=64):
        super().__init__()
        self.net = nn.Sequential(
            nn.ConvTranspose2d(nz, ngf*8, 4, 1, 0, bias=False),
            nn.BatchNorm2d(ngf*8), nn.ReLU(True),

            nn.ConvTranspose2d(ngf*8, ngf*4, 4, 2, 1, bias=False),
            nn.BatchNorm2d(ngf*4), nn.ReLU(True),

            nn.ConvTranspose2d(ngf*4, ngf*2, 4, 2, 1, bias=False),
            nn.BatchNorm2d(ngf*2), nn.ReLU(True),

            nn.ConvTranspose2d(ngf*2, 3, 4, 2, 1, bias=False),
            nn.Tanh()                                    # output in [-1, 1]
        )

    def forward(self, z):
        return self.net(z.view(z.size(0), -1, 1, 1))

Noise shape: [batch, 100] → reshaped to [batch, 100, 1, 1] → upsampled to [batch, 3, 32, 32].

Year	Model	What it did
2014	GAN	original paper — 28×28 MNIST
2015	DCGAN	convolutional, stable training
2017	WGAN-GP	Wasserstein + gradient penalty
2017	ProGAN	progressive growing → 1024×1024 faces
2019	StyleGAN	disentangled latent, hyper-realistic faces
2021	StyleGAN3	temporal consistency, aliasing fixes

	GANs	Diffusion
Training stability	✗ brittle	✓ stable
Sample quality	✓ (SOTA 2018-2020)	✓✓ (SOTA 2021+)
Likelihood	✗	≈
Inference speed	✓✓ one pass	✗ many passes
Diversity	mode collapse risk	✓ natural
Latent space	rich, explorable	uniform-ish

GANs

Lecture 20 · ES 667: Deep Learning

Learning outcomes

Where we are

PART 1

Why a new paradigm?

What "generate" even means

The generation task · picture

Why not just fit a Gaussian?

The deep-generative idea

The 2014 insight · use a classifier

The forger-and-detective analogy

The GAN pipeline

Two networks, one game

A 1D toy · watch G learn a bimodal target

PART 2

The minimax objective

Minimax · derived from binary cross-entropy

Worked numeric · a single step

optional · deriving the optimal D

Alternating updates · the training loop

Why .detach() matters

G's gradient · saturating vs non-saturating

The non-saturating trick

Worked numeric · the gradient gap

Why GAN training is fundamentally hard

PART 3

DCGAN · the architecture that worked

Before DCGAN · GANs barely trained

DCGAN · architecture at a glance

DCGAN · why these specific tricks?

DCGAN · five architectural guidelines

Transposed convolution · upsampling primitive

Why BN in both networks

Generator in PyTorch · DCGAN

Discriminator in PyTorch · DCGAN

Hyperparameter recipe that works

PART 4

Training instability & mode collapse

Why GANs are hard to train

Mode collapse visually

Why mode collapse happens · mechanism

Fixing mode collapse · the toolbox

Diagnosing GAN health

FID in one paragraph

PART 5

WGAN · Wasserstein distance

The problem with JS

Wasserstein distance · the picture

Earth-mover distance · intuition

WGAN · critic with a speed limit

WGAN-GP · highway-patrol analogy

WGAN-GP · the gradient penalty, term by term

Why WGAN fixed everything

PART 6

StyleGAN · the GAN peak

What StyleGAN changed

StyleGAN · hierarchical style injection

StyleGAN · the latent hierarchy

GAN applications · the 2017-2021 peak

PART 7

GANs in 2026

The GAN era · 2014-2020

After GANs · diffusion took over

Why diffusion won · the real reasons

When to still reach for a GAN

Adversarial thinking beyond GANs

Lecture 20 — summary

Read before Lecture 21

Next lecture

Why `.detach()` matters