Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind - View it on GitHub
Star
135
Rank
229010