Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training - View it on GitHub
Star
132
Rank
200953