SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs - View it on GitHub
Star
200
Rank
175848