PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECC...
[CVPR 2023 Workshop] VAND Challenge: 1st Place on Zero-shot AD and 4th P...
Connecting segment-anything's output masks with the CLIP model; Awesome-...
Teaching robots to respond to open-vocab queries with CLIP and NeRF-like...
Reproducible scaling laws for contrastive language-image learning (https...
[ICCV 2023] Prompt-aligned Gradient for Prompt Tuning
CLIP implementation for Russian language
Everything you need to know about Transformers! 🤖
Python package to generate image embeddings with CLIP without PyTorch/Te...
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Dis...
MARCELO: an AI powered bot to automate the editing and thumbnail creatio...
Famous Vision Language Models and Their Architectures
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic r...
Yet Another Stable Diffusion Discord Bot
Pytorch code for Language Models with Image Descriptors are Strong Few-S...