Fast inference from large lauguage models via speculative decoding - View it on GitHub
Star
0
Rank
13852712