Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation - View it on GitHub
Star
687
Rank
60489