paolosalvatori/shared-azure-openai-tpm

paolosalvatori

Fetched on 2026/06/23 01:24

This example shows how a multitenant service can distribute requests evenly among multiple Azure OpenAI Service instances and manage tokens per minute (TPM) for multiple tenants. - View it on GitHub

Star

Rank

2010938

paolosalvatori

paolosalvatori / shared-azure-openai-tpm