Low-bit LLM inference on CPU/NPU with lookup table - View it on GitHub
Star
901
Rank
43201