FinetuneLLMs Save

Finetune an LLM, within a few clicks!

Project README

FinetuneLLMs (Work in Progress, Actively! πŸ”₯)

Finetune an LLM, within a few clicks!

πŸ”₯Goal & RoadmapπŸ”₯

The main objective of this project is to lower the barrier to training large language models, especially for startup companies that have hardware on hand. With this project, it should be easy for a company to start experimenting with LLM training within a basic setup on servers with GPU/CPU resources.

In a way, it helps provide an opportunity for everyone who has hardware available and wants to utilize it in the AI field.

Roadmap

Supported finetuning techniques

Model \ Method SFT DPO ORPO KTO PRO
llama 2 βœ… ❌ ❌ ❌ ❌
llama 3 βœ… ❌ βœ… ❌ ❌
gguf βœ… ❌ ❌ ❌ ❌
phi-3 βœ… ❌ ❌ ❌ ❌
Mistral βœ… βœ… ❌ ❌ ❌
... ? ? ? ? ?

General Setup

This repo provides 3 modules, frontend (react), server (nodejs), and trainer (python django)

You need CUDA for now, but once llama.cpp is integrated, this will no longer be required.

Dev Setup

Setup frontend

// copy .env.example to .env
cd frontend
npm ci
npm run dev
// or yarn && yarn dev

Setup server

// copy .env.example to .env and .env.development
cd server
npx prisma migrate dev
yarn
yarn dev

Setup trainer

cd trainer
pip install -r requirements.txt
daphne trainer.asgi:application
Open Source Agenda is not affiliated with "FinetuneLLMs" Project. README Source: jazelly/FinetuneLLMs
Stars
36
Open Issues
7
Last Commit
1 day ago
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating