BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment. - View it on GitHub
Star
405
Rank
79114