Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind - View it on GitHub
Star
115
Rank
228588