transformer no longer returns unnecessary attention weights. fix: allow backward when training ingredient decoder 3ab629c amaiasalvador commited on Jul 3, 2019