A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) - View it on GitHub
Star
5029
Rank
4214