BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment. - View it on GitHub
Star
758
Rank
54818