SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs - View it on GitHub
Star
176
Rank
181888