Towards Video Text Visual Question Answering: Benchmark and Baseline - View it on GitHub
Star
37
Rank
524570