A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) - View it on GitHub
Star
5121
Rank
4240