Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks
LLaVA_XTuner
models by @LZHgrla in https://github.com/open-compass/VLMEvalKit/pull/17
LLaVA-InternLM2
by @LZHgrla in https://github.com/open-compass/VLMEvalKit/pull/53
Full Changelog: https://github.com/open-compass/VLMEvalKit/commits/v0.1