Quantize transformers to any learned arbitrary 4-bit numeric format - View it on GitHub
Star
41
Rank
543342