Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind - View it on GitHub
Star
0
Rank
13837481