The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression> - View it on GitHub
Star
120
Rank
235278