Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome - View it on GitHub
Star
0
Rank
11290355