[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs. - View it on GitHub
Star
750
Rank
48878