VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language mo...
A neural network to generate captions for an image using CNN and RNN wit...