The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling" - View it on GitHub
Star
4
Rank
2666531