Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch - View it on GitHub
Star
225
Rank
133235