Adding Encoder-decoder attention cache. https://github.com/tensorflow/tensor2tensor/pull/827 - View it on GitHub
Star
0
Rank
11682922