Official repository of paper titled "Learning to Prompt with Text Only S...
Awesome Multimodal Assistant is a curated list of multimodal chatbots/co...
A general representation modal across vision, audio, language modalities.
The repository of ECCV 2020 paper `Active Visual Information Gathering f...
Authors official PyTorch implementation of the "ContraCLIP: Interpretabl...
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
The released data for paper "Measuring and Improving Chain-of-Thought Re...