"LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", Accepted to ACL 2024 - View it on GitHub
Star
1
Rank
5485387