Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch - View it on GitHub
Star
229
Rank
144265