Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)
This repository is the PyTorch implementation of the paper:
Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)
Shweta Mahajan, Iryna Gurevych, Stefan Roth
This code has been developed under Python 3.5, Pytorch 1.0.0 and CUDA 10.0.
requirements.py
to check if all required packages are installed.The script train.py
is used for training. The parameters are listed in params.json
. Note that there are two different configurations for best performance on the image captioning and text-to-image synthesis tasks.
Example usage to train a model on COCO 2014 for captioning is,
python train.py --config params_i2t
Example usage to train a model on COCO 2014 for text-to-image synthesis task is,
python train.py --config params_t2i
Note that for training CUDA 10.0 and GPU devices are required. The number of GPUs used can be set in params.json
. Also note that we use 1 Nvidia Volta V100 GPU and 3 Nvidia Volta V100 GPUs with 32GB for the captioning and text-to-image synthetis tasks respectively.
For evalutaion we use the following repos,
Checkpoints are available for text-to-image synthesis and for image captioning.
@inproceedings{mahajan2020latent,
title = {Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings},
author = {Mahajan, Shweta and Gurevych, Iryna and Roth, Stefan},
booktitle = {International Conference on Learning Representations},
year = {2020},
}