microsoft/TaskTracker - Gitstar Ranking

microsoft

Fetched on 2025/07/24 20:55

TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a simple linear probe-based method and a more sophisticated metric learning method to achieve this. The project also releases the computationally expensive activation data to stimulate further AI safety research. - View it on GitHub

Star

Rank

406588

microsoft

microsoft / TaskTracker