mfkiwl/distributed-llama - Gitstar Ranking

mfkiwl

Fetched on 2026/03/14 06:23

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed. - View it on GitHub

Star

Rank

13829210

mfkiwl

mfkiwl / distributed-llama