Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
bytedance
Fetched on 2025/04/15 20:31
bytedance
/
FlexPrefill
Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference -
View it on GitHub
https://arxiv.org/abs/2502.20766
Star
82
Rank
315858