Automatic image captioning model based on Caffe, using features from bottom-up attention.
No resources for this project.