A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) - View it on GitHub
Star
5452
Rank
5441