Adaptive Precision for EXpert Models: MoE-aware mixed-precision quantization - View it on GitHub
Star
301
Rank
126202