Add support for quantization int4 for faster inference. - View it on GitHub
Star
13
Rank
1173505