Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
intel
Fetched on 2024/09/27 14:08
intel
/
auto-round
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs" -
View it on GitHub
https://arxiv.org/abs/2309.05516
Star
222
Rank
132053