Pruning: Remove Useless Weights
Observation: Many weights in neural networks are close to zero.
Idea: Remove them!
Before pruning: [0.9, 0.01, -0.8, 0.001, 0.7]
After 40% pruning: [0.9, 0, -0.8, 0, 0.7]
Benefits:
- Smaller model
- Faster inference (fewer multiplications)