ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs
tuner_backend
parameter changed to peft
. The interface of peft has been dynamically patched to support parameters like lora_dtype
.Ruozhiba
dataset: https://github.com/modelscope/swift/blob/main/docs/source_en/LLM/Supported-models-datasets.md
train_dataset_mix_ds
using custom_local_path by @Jintao-Huang in https://github.com/modelscope/swift/pull/582
Full Changelog: https://github.com/modelscope/swift/compare/v1.7.3...v2.0.0
swift export
and update docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/484
Full Changelog: https://github.com/modelscope/swift/compare/v1.6.0...v1.7.0
Full Changelog: https://github.com/modelscope/swift/compare/v1.6.0...v1.6.1
Full Changelog: https://github.com/modelscope/swift/compare/v1.5.4...v1.6.0
Full Changelog: https://github.com/modelscope/swift/compare/v1.5.3...v1.5.4
English Version
中文版本
Full Changelog: https://github.com/modelscope/swift/compare/v1.5.1...v1.5.2
Full Changelog: https://github.com/modelscope/swift/compare/v1.5.0...v1.5.1
New features:
swift web-ui
New tuners:
New models:
New datasets:
Datasets used in RLHF:
SWIFT本月新版本已经发布!
新特性:
swift web-ui
开启新tuners:
SCEdit: 通义实验室自研的优秀U-Net微调框架,显存占用远小于LoRA,效果较LoRA更好,且可以替代实现ControlNet的效果,实现In-Painting/Out-Paining/标签去除/Pose控制等能力
新模型:
SUS系列模型 Mixtral-MoE系列模型 deepseek系列模型 phi2-3b cogagent-chat/cogagent-vqa codegeex2-6b
新数据集:
用于RLHF的数据集: hh-rlhf stack-exchange-paired
Full Changelog: https://github.com/modelscope/swift/compare/v1.4.0...v1.5.0