hasura/agent-reliability-tool

hasura

Fetched on 2026/05/31 13:49

A tool for testing and understanding the reliability of LLM Agents. This tool evaluates agents on two key dimensions: 1. Visibility: How well the agent explains what it's doing 2. Repeatability: How consistent the agent's responses are - View it on GitHub

Star

Rank

14097786

hasura

hasura / agent-reliability-tool