SteerViT is a framework that equips any ViT with the ability to steer both its global and local visual representations with natural language. - View it on GitHub
Star
2
Rank
4229555