This the official implementation of Nearest Neighbor Speculative Decoding (https://arxiv.org/abs/2405.19325), an inference-time revision approach to enhance LLM factuality and generation attribution. - View it on GitHub
Star
9
Rank
1524164