SpargeAttention: A training-free sparse attention that can accelerate any model inference. - View it on GitHub
Star
1
Rank
5919397