SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs - View it on GitHub
Star
139
Rank
213596