Towards Video Text Visual Question Answering: Benchmark and Baseline - View it on GitHub
Star
40
Rank
567273