Learning phrase grounding from captioned images through InfoNCE bound on mutual information - View it on GitHub
Star
0
Rank
11399557