thunlp/hybrid-linear-attention

thunlp

Fetched on 2026/05/31 12:34

Code and models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts - View it on GitHub

Star

Rank

661413

thunlp

thunlp / hybrid-linear-attention