Getting the latest versions of Disco Diffusion to work locally, instead ...
AutoCut Client
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabula...
Official Pytorch implementation of "CLIPstyler:Image Style Transfer with...
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,M...
Language Models Can See: Plugging Visual Controls in Text Generation
GenSim: Generating Robotic Simulation Tasks via Large Language Models
Paddle Multimodal Integration and eXploration, supporting mainstream mul...
Give a custom shape to any flutter widget, Material Design 2 ready
A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and Op...
Search photos on Unsplash based on OpenAI's CLIP model, support search w...
CLIP (Contrastive Language–Image Pre-training) for Italian
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrar...
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
[CVPR 2023 Workshop] VAND Challenge: 1st Place on Zero-shot AD and 4th P...