ml

Weight Initialization

Xavier Uniform/Normal, He Uniform/Normal, LeCun Normal, Random, Zeros, Ones — histogram of 20 samples vs theoretical PDF, per-activation recommendations.

Configuration

Formula
N(0, σ²), σ = √(2 / fan_in)
Computed: σ = 0.125
Sample mean
0.040579
theory: 0
Sample variance
0.022222
theory: 0.015625
w1w2w3w4w5w6w7w8w9w10
0.06870.21360.06670.0595-0.08470.11150.0456-0.16260.29060.2388
-0.00190.0524-0.09680.12260.24390.09920.0073-0.0740-0.0726-0.3161

Weight Distribution

-0.346-0.213-0.0790.0540.1870.3210.001.082.153.234.31SamplesTheoryDensityWeight value
Sample histogramTheoretical distribution

Method Comparison

MethodParameterNotes
Xavier Uniformlimit = 0.1768
Xavier Normalσ = 0.1021
He Uniformlimit = 0.2165✓ recommended
He Normalσ = 0.125✓ recommended
LeCun Normalσ = 0.08839
Random Normalσ = 0.01
Random Uniformlimit = 0.05
Zeros⚠ symmetry
Ones⚠ symmetry
Tip: Click any row to switch method. Recommended methods are highlighted for the selected activation. He Normal is generally preferred for ReLU networks; Xavier for saturating activations.