Bare-metal FPGA implementation of the pccx NPU for LLM inference on Kria KV260: SystemVerilog RTL, W4A8 quantization, GEMM/GEMV datapaths, KV-cache scheduling, and driver code. - View it on GitHub
Star
0
Rank
14047817