aws-samples/scalable-hw-agnostic-inference

aws-samples

Fetched on 2026/07/10 10:05

A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA. - View it on GitHub

Star

Rank

816967

aws-samples

aws-samples / scalable-hw-agnostic-inference