AMD PACE is a PyTorch extension for high-performance LLM inference on AMD CPUs. It provides a framework to develop and test novel optimizations, accelerating real-time deployment. - View it on GitHub
Star
9
Rank
1682825