mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Open-source evaluation toolkit of large vision-language models (LVLMs), ...
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robus...