A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) - View it on GitHub
Star
5509
Rank
5585