File size: 90 Bytes
5fa1a76
1
Reduce operations are lossy, for example when gradients are averaged across multiple GPUs.