awesome grounding: A curated list of research papers in visual grounding
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natur...
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transfor...
paper list of robotic grasping and some related works
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grou...
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Qu...
Referring Video Object Segmentation / Multi-Object Tracking Repo
[CVPR20] Video Object Grounding using Semantic Roles in Language Descrip...
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
[CVPR2021] Look before you leap: learning landmark features for one-stag...
PyTorch code for: Learning to Generate Grounded Visual Captions without ...
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense C...