Unify Efficient Fine-Tuning of 100+ LLMs
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Langu...
Firefly: 大模型训练工具,支持训练Llama3、Gemma、MiniCPM、Yi、Deepseek、O...
Instruction Tuning with GPT-4
Aligning pretrained language models with instruction data generated by t...
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
We unified the interfaces of instruction-tuning data (e.g., CoT data), m...
Video-LLaVA: Learning United Visual Representation by Alignment Before P...
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
A one-stop data processing system to make data higher-quality, juicier, ...
An Open-sourced Knowledgable Large Language Model Framework.
A collection of open-source dataset to train instruction-following LLMs ...
Video Foundation Models & Data for Multimodal Understanding
Crosslingual Generalization through Multitask Finetuning