A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Following new datasets were added:
Note: There are a lot of breaking changes in the API from v0.1. Refer to the documentation to learn more on how to work with Pythia v0.3.