microsoft/LLMLingua - Gitstar Ranking

microsoft

Fetched on 2026/02/08 04:14

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss. - View it on GitHub

https://llmlingua.com/

Star

5823

Rank

6236

microsoft

microsoft / LLMLingua