Bottom-up attention model for image captioning and VQA, based on Faster ...
awesome grounding: A curated list of research papers in visual grounding
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Show, Control and Tell: A Framework for Generating Controllable and Grou...
A neural network to generate captions for an image using CNN and RNN wit...
A modular library built on top of Keras and TensorFlow to generate a cap...
Automatic image captioning model based on Caffe, using features from bot...
:camera_flash: Generates hashtags for Instagram posts. Upload your photo...
This repository contains my solutions to the assignments for Stanford's ...
[DEPRECATED] A Neural Network based generative model for captioning imag...
Implementation of Diverse and Accurate Image Description Using a Variati...
Adds SPICE metric to coco-caption evaluation server codes
EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Im...
PyTorch code for: Learning to Generate Grounded Visual Captions without ...
Deep CNN-LSTM for Generating Image Descriptions :smiling_imp: