SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs - View it on GitHub
Star
96
Rank
278630