An open source implementation of CLIP.
Examples and tutorials on using SOTA computer vision models and techniqu...
Video Foundation Models & Data for Multimodal Understanding
Diffusion Classifier leverages pretrained diffusion models to perform ze...
Reproducible scaling laws for contrastive language-image learning (https...
PyTorch code for MUST
[TPAMI 2023] Generative Multi-Label Zero-Shot Learning
Alternate Implementation for Zero Shot Text Classification: Instead of r...