Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time" - View it on GitHub
Star
355
Rank
90467