Pocket-Sized Multimodal AI for content understanding and generation acro...
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Imag...
An official implementation for "CLIP4Clip: An Empirical Study of CLIP fo...
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-...
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代...
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Ima...
React component for truncating multi-line spans and adding an ellipsis.
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenex...
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
ZMJImageEditor is a picture editing component like WeChat. It is powerfu...
Extract video features from raw videos using multiple GPUs. We support R...
CLIPort: What and Where Pathways for Robotic Manipulation
Open-source evaluation toolkit of large vision-language models (LVLMs), ...
Android Easy Reveal Library
中文CLIP预训练模型