microsoft/RHELM - Gitstar Ranking

microsoft

Fetched on 2026/07/23 05:42

RHELM is a comprehensive benchmark for evaluating long-horizon memory capabilities in AI systems. Unlike existing benchmarks that focus on static dialogues, RHELM introduces realistic, heterogeneous, and evolving memory challenges that better reflect real-world assistant scenarios. - View it on GitHub

Star

Rank

1605123

microsoft

microsoft / RHELM