ML data annotations made super easy for teams. Just upload data, add you...
Language Models Can See: Plugging Visual Controls in Text Generation
Automatic image captioning model based on Caffe, using features from bot...
Custom ComfyUI nodes for Vision Language Models, Large Language Models, ...
Image Captions Generation with Spatial and Channel-wise Attention
Computer vision and Deep learning
Official pytorch implementation of paper "Dual-Level Collaborative Trans...
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
This repository explores the variety of techniques and algorithms common...
Enriching MS-COCO with Chinese sentences and tags for cross-lingual mult...
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Ima...
[DEPRECATED] A Neural Network based generative model for captioning imag...
Deep Learning workshop including image classification, face recognition,...
Computer vision tools for fairseq, containing PyTorch implementation of ...
gis (go image server) go 实现的图片服务,实现基本的上传,下载,存储,按...