Using pretrained encoder and language models to generate captions from multimedia inputs. - View it on GitHub
Star
2
Rank
3760915