awesome grounding: A curated list of research papers in visual grounding
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning an...
[CVPR20] Video Object Grounding using Semantic Roles in Language Descrip...
PyTorch Implementation of Consensus-based Sequence Training for Video Ca...
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://...
a simple yet interesting tool for chatting with video
PyTorch code for: Learning to Generate Grounded Visual Captions without ...
Video to Language Challenge (MSR-VTT Challenge 2016)