DFVG: A Heterogeneous Architecture for Speculative Decoding with Draft-on-FPGA and Verify-on-GPU. - View it on GitHub
Star
0
Rank
13829210