FF's Notes
← Home

Memory Optimization Tricks

Aug 12, 2025

Save tensors with 16 bit

Gradient checkpointing

Saves memory by recomputing intermediate activations during backprop instead of storing them

Quantize model