Supplement · Weight Initialization

Weight Initialization in PyTorch

Colab Notebook · ~50 min

Google Colab Notebook

Python · ~50 min

Lab Objectives

Apply every torch.nn.init function from scratch and verify the resulting mean, variance, and shape of initialized tensors

Run a variance propagation experiment: measure activation variance at every layer of a 20-layer MLP under zeros_, N(0,1), Xavier, and He initialization

Demonstrate the symmetry problem empirically: train an MLP with all-zero weights and show that all neurons remain identical throughout training

Compare convergence speed and final accuracy of Xavier vs He initialization on a 10-layer ReLU MLP trained on MNIST

Implement orthogonal initialization for the hidden-to-hidden weights of a vanilla RNN and show numerically that W_hh^T @ W_hh ≈ I

Lab Overview

This notebook makes every formula from the readings concrete and experimentally verifiable. You will instrument forward passes to measure activation statistics, compare initialization strategies on real training runs, and build intuition for why the right starting point matters.

Sections

Section	Content
1	All torch.nn.init functions — apply each, verify with `.mean()`, `.var()`, shape checks
2	Variance propagation experiment — 20-layer linear network, measure Var(activation) at each layer under zeros, N(0,1), Xavier, He
3	The symmetry problem — train an MLP with zeros init; plot per-neuron weight evolution to show they never diverge
4	Xavier vs He on a ReLU MLP — 10-layer network on MNIST; plot loss curves, final test accuracy, activation histograms at each layer
5	Orthogonal RNN — initialize hidden-to-hidden weights with orthogonal_; verify W.T @ W ≈ I; compare gradient norms across timesteps with and without orthogonal init
6	PyTorch defaults audit — inspect the actual initialized weights of nn.Linear, nn.Conv2d, nn.Embedding, nn.LSTM, and nn.BatchNorm2d; verify they match the documented defaults

Previous Next →

Weight Initialization in PyTorch

Lab Overview

Sections

Privacy Policy

What we collect

What we don't collect

Your choices

Contact