Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training - View it on GitHub
Star
141
Rank
221011