[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression> - View it on GitHub
Star
155
Rank
215699