mfkiwl/matmul-cpu - Gitstar Ranking

mfkiwl

Fetched on 2026/06/23 02:18

High-performance CPU GEMM kernels (C = A·Bᵀ + C) optimized for LLM inference, featuring AVX2/AVX-512 SIMD and multi-threading. Benchmarked against OpenBLAS. - View it on GitHub

https://codebearjourney.top/hpc/cpu/matmul

Star

Rank

14050827

mfkiwl

mfkiwl / matmul-cpu