Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch - View it on GitHub
Star
226
Rank
142902