Code base for "Sim-LLM: Efficient Inference Optimization Based on Task Similarity for Large Language Models in Edge" - View it on GitHub
Star
2
Rank
3970269