torch Optimizer.zero_grad()

Creator

Seonglae Cho

Created

2024 Nov 28 11:42

Editor

Seonglae Cho

Edited

2024 Nov 28 11:45

Refs

torch.no_grad()

Clear Gradients in Optimizer

This is better than zero_grad() function call since it avoids zeroing the memory for each parameter and reduces unnecessary memory operations, leading to a more efficient backward pass. Use when you need to reduce memory overhead and speed up training without affecting gradient calculation.

Recommendations

/////////