Video Foundation Models & Data for Multimodal Understanding
Papers, code and datasets about deep learning and multi-modal learning f...
500,000 multimodal short video data and baseline models. 50万条多模态短...
Tools for loading video dataset and transforms on video in pytorch. You ...
:seedling: Starter kit for working with the EPIC-KITCHENS-55 dataset for...
SoccerAct10 is a dataset which contains 10 different soccer actions. Thi...