Modelscope Swift Versions Save

ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs

v2.0.4

2 weeks ago

v2.0.3

3 weeks ago

v2.0.0

1 month ago

New Features

Support for peft 0.10.x version, with the default value of the tuner_backend parameter changed to peft. The interface of peft has been dynamically patched to support parameters like lora_dtype.
Support for vllm+lora inference.
Refactored and updated the README file.
Added English versions of the documentation. Currently, all documents have both English and Chinese versions.
Support for training 70B models using FSDP+QLoRA on dual 24GB GPUs. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/llama2_70b_chat/qlora_fsdp/sft.sh
Support for training agents and using the ModelScopeAgent framework. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source/LLM/Agent%E5%BE%AE%E8%B0%83%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md
Support for model evaluation and benchmark. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source/LLM/LLM%E8%AF%84%E6%B5%8B%E6%96%87%E6%A1%A3.md
Support for multi-task experiment management. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source/LLM/LLM%E5%AE%9E%E9%AA%8C%E6%96%87%E6%A1%A3.md
Support for GaLore training.
Support for training and inference of AQLM and AWQ quantized models.

New Models

MAMBA series models. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/mamba-1.4b/lora/sft.sh
DeepSeek VL series models. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source_en/Multi-Modal/deepseek-vl-best-practice.md
LLAVA series models. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source/Multi-Modal/llava%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md
TeleChat models. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/telechat_12b/lora/sft.sh
Grok-1 models. Documentation available at: https://github.com/modelscope/swift/blob/main/docs/source_en/LLM/Grok-1-best-practice.md
Qwen 1.5 MoE series models for training and inference.
dbrx models for training and inference. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/dbrx-instruct/lora_mp/sft.sh
Mengzi3 models for training and inference. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/mengzi3_13b_base/lora_ddp_ds/sft.sh
Xverse MoE models for training and inference. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/xverse_moe_a4_2b/lora/sft.sh
c4ai-command-r series models for training and inference.
MiniCPM series models for training and inference. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/minicpm_moe_8x2b/lora_ddp/sft.sh
Mixtral-8x22B-v0.1 models for training and inference. Script available at: https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/mixtral_moe_8x22b_v1/lora_ddp_ds/sft.sh

New Datasets

Support for the Ruozhiba dataset: https://github.com/modelscope/swift/blob/main/docs/source_en/LLM/Supported-models-datasets.md

What's Changed

Fix RsLoRA by @tastelikefeet in https://github.com/modelscope/swift/pull/567
Fix yi-vl merge lora by @Jintao-Huang in https://github.com/modelscope/swift/pull/568
Add doc for tuner module by @tastelikefeet in https://github.com/modelscope/swift/pull/571
update agent documentation by @tastelikefeet in https://github.com/modelscope/swift/pull/572
Update agent doc to fix some conflicts by @tastelikefeet in https://github.com/modelscope/swift/pull/573
support vllm lora by @Jintao-Huang in https://github.com/modelscope/swift/pull/565
Support llava by @Jintao-Huang in https://github.com/modelscope/swift/pull/577
fix app-ui max_length is None by @Jintao-Huang in https://github.com/modelscope/swift/pull/580
support train_dataset_mix_ds using custom_local_path by @Jintao-Huang in https://github.com/modelscope/swift/pull/582
Fix LRScheduler by @tastelikefeet in https://github.com/modelscope/swift/pull/586
compat with transformers==4.39 by @Jintao-Huang in https://github.com/modelscope/swift/pull/584
Fix weight saving by @tastelikefeet in https://github.com/modelscope/swift/pull/589
fix mix_dataset_sample float by @Jintao-Huang in https://github.com/modelscope/swift/pull/594
Refactor all docs by @tastelikefeet in https://github.com/modelscope/swift/pull/599
fix tiny bugs in docs by @tastelikefeet in https://github.com/modelscope/swift/pull/600
fix issue template and add a pr one by @tastelikefeet in https://github.com/modelscope/swift/pull/601
Fix/security template by @tastelikefeet in https://github.com/modelscope/swift/pull/603
update docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/604
support Mistral-7b-v0.2 by @hjh0119 in https://github.com/modelscope/swift/pull/605
fix deploy safe_response by @Jintao-Huang in https://github.com/modelscope/swift/pull/614
Fix Adalora with devicemap by @tastelikefeet in https://github.com/modelscope/swift/pull/619
update ui by @tastelikefeet in https://github.com/modelscope/swift/pull/621
support TeleChat-12b by @hjh0119 in https://github.com/modelscope/swift/pull/607
fix save dir (additional_files) by @Jintao-Huang in https://github.com/modelscope/swift/pull/622
fix Telechat model by @hjh0119 in https://github.com/modelscope/swift/pull/623
Add Grok model by @tastelikefeet in https://github.com/modelscope/swift/pull/629
add missing files by @tastelikefeet in https://github.com/modelscope/swift/pull/631
support qwen1.5-moe model by @hjh0119 in https://github.com/modelscope/swift/pull/627
support Telechat-7b model by @hjh0119 in https://github.com/modelscope/swift/pull/630
support model Dbrx by @hjh0119 in https://github.com/modelscope/swift/pull/643
fix ui by @tastelikefeet in https://github.com/modelscope/swift/pull/648
fix typing hint by @Jintao-Huang in https://github.com/modelscope/swift/pull/649
support Mengzi-13b-base model by @hjh0119 in https://github.com/modelscope/swift/pull/646
support Qwen1.5-32b models by @hjh0119 in https://github.com/modelscope/swift/pull/655
fix plot error by @tastelikefeet in https://github.com/modelscope/swift/pull/651
Support FSDP + QLoRA by @tastelikefeet in https://github.com/modelscope/swift/pull/659
move fsdp config path by @tastelikefeet in https://github.com/modelscope/swift/pull/662
change the default value of ddp_backend by @tastelikefeet in https://github.com/modelscope/swift/pull/667
fix ui log by @tastelikefeet in https://github.com/modelscope/swift/pull/669
support Xverse-MoE model by @hjh0119 in https://github.com/modelscope/swift/pull/668
Support longlora for transformers 4.38 by @tastelikefeet in https://github.com/modelscope/swift/pull/456
add ruozhiba datasets by @tastelikefeet in https://github.com/modelscope/swift/pull/670
compatible with old versions of modelscope by @tastelikefeet in https://github.com/modelscope/swift/pull/671
Fix data_collator by @tastelikefeet in https://github.com/modelscope/swift/pull/674
[TorchAcc][Experimental] Integrate TorchAcc. by @baoleai in https://github.com/modelscope/swift/pull/647
update Agent best practice with Modelscope-Agent by @hjh0119 in https://github.com/modelscope/swift/pull/676
support c4ai-command-r model by @hjh0119 in https://github.com/modelscope/swift/pull/684
Support Eval by @tastelikefeet in https://github.com/modelscope/swift/pull/494
fix anchor by @tastelikefeet in https://github.com/modelscope/swift/pull/687
Fix/0412 by @tastelikefeet in https://github.com/modelscope/swift/pull/690
support minicpm and mixtral-moe model by @hjh0119 in https://github.com/modelscope/swift/pull/692
fix device_map 4 (qwen-vl) by @Jintao-Huang in https://github.com/modelscope/swift/pull/695
fix multimodal model image_mode = 'CMYK' (fix issue#677) by @Jintao-Huang in https://github.com/modelscope/swift/pull/697
feat(model): support minicpm-v-2(#699 ) by @YuzaChongyi in https://github.com/modelscope/swift/pull/699

New Contributors

@hjh0119 made their first contribution in https://github.com/modelscope/swift/pull/605
@YuzaChongyi made their first contribution in https://github.com/modelscope/swift/pull/699

Full Changelog: https://github.com/modelscope/swift/compare/v1.7.3...v2.0.0

v1.7.0

2 months ago

New Features:

Added support for swift export, enabling awq-int4 quantization and gpt-int2,3,4,8 quantization. Models can be pushed to the Modelscope Hub. You can view the documentation here.
Enabled fine-tuning of awq quantized models.
Enabled fine-tuning of aqlm quantized models.
Added support for deploying LLM with infer_backend='pt'.
Added web-ui with task management and visualization of training loss, eval loss, etc. Inference is accelerated using VLLM.

New Tuners:

Lora+.
LlamaPro.

New Models:

qwen1.5 awq series.
gemma series.
yi-9b.
deepseek-math series.
internlm2-1_8b series.
openbuddy-mixtral-moe-7b-chat.
llama2 aqlm series.

New Datasets:

ms-bench-mini.
hh-rlhf-cn series.
disc-law-sft-zh, disc-med-sft-zh.
pileval.

What's Changed

Fix vllm==0.3 deploy bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/412
Support deepseek math by @Jintao-Huang in https://github.com/modelscope/swift/pull/413
update support_vllm by @Jintao-Huang in https://github.com/modelscope/swift/pull/415
fix zero3 & swift lora by @Jintao-Huang in https://github.com/modelscope/swift/pull/416
Support peft0.8.0 by @tastelikefeet in https://github.com/modelscope/swift/pull/423
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/426
fix pai open with 'a' by @Jintao-Huang in https://github.com/modelscope/swift/pull/430
default load_best_model_at_end=False by @Jintao-Huang in https://github.com/modelscope/swift/pull/432
support openbuddy mixtral by @Jintao-Huang in https://github.com/modelscope/swift/pull/437
support gemma by @Jintao-Huang in https://github.com/modelscope/swift/pull/441
Support ms bench mini by @Jintao-Huang in https://github.com/modelscope/swift/pull/442
Add roadmap and contributing doc by @tastelikefeet in https://github.com/modelscope/swift/pull/431
support peft format by @tastelikefeet in https://github.com/modelscope/swift/pull/438
update contributing.md by @Jintao-Huang in https://github.com/modelscope/swift/pull/446
fix link by @tastelikefeet in https://github.com/modelscope/swift/pull/447
Fix rlhf dataset by @tastelikefeet in https://github.com/modelscope/swift/pull/451
Add task management for webui by @tastelikefeet in https://github.com/modelscope/swift/pull/457
Support swift export by @Jintao-Huang in https://github.com/modelscope/swift/pull/455
Fix llm quantization docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/458
fix get_vllm_engine bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/463
use cpu export by @Jintao-Huang in https://github.com/modelscope/swift/pull/462
Fix llama2 generation config by @Jintao-Huang in https://github.com/modelscope/swift/pull/468
Support editing model_id_or_path by @tastelikefeet in https://github.com/modelscope/swift/pull/469
Support pt deploy by @Jintao-Huang in https://github.com/modelscope/swift/pull/467
Fix swift deploy bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/470
fix deploy dep by @Jintao-Huang in https://github.com/modelscope/swift/pull/471
Support LLaMAPRO and LoRA+ by @tastelikefeet in https://github.com/modelscope/swift/pull/472
Support internlm2 1.8b by @Jintao-Huang in https://github.com/modelscope/swift/pull/473
fix deepseek moe device_map by @Jintao-Huang in https://github.com/modelscope/swift/pull/476
fix peft compatible bug by @tastelikefeet in https://github.com/modelscope/swift/pull/482
Fix deepspeed init bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/481
fix scripts docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/483
Update swift export and update docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/484
support gptq export by @Jintao-Huang in https://github.com/modelscope/swift/pull/485
fix docs & readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/486
fix app-ui bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/488
Support peft0.9 by @tastelikefeet in https://github.com/modelscope/swift/pull/490
support torchrun_args for dpo cli and support web_ui model deployment by @slin000111 in https://github.com/modelscope/swift/pull/496
Support transformers 4.33.0 by @tastelikefeet in https://github.com/modelscope/swift/pull/498
Update deepspeed config by @Jintao-Huang in https://github.com/modelscope/swift/pull/500
move docs to classroom by @tastelikefeet in https://github.com/modelscope/swift/pull/503
Support yi 9b by @Jintao-Huang in https://github.com/modelscope/swift/pull/504
Update yi sh by @Jintao-Huang in https://github.com/modelscope/swift/pull/506

Full Changelog: https://github.com/modelscope/swift/compare/v1.6.0...v1.7.0

v1.6.1

2 months ago

New Models:

deepseek-math series

New Datasets:

sharegpt-gpt4-mini
disc-law-sft-zh
disc-med-sft-zh

Bug Fix

Fix vllm==0.3 & swift deploy bug.
Fix zero3 & swift lora bug.

Full Changelog: https://github.com/modelscope/swift/compare/v1.6.0...v1.6.1

v1.6.0

3 months ago

New Features:

Agent Training
AIGC support: controlnet, controlnet_sdxl, dreambooth, text_to_image, text_to_image_sdxl
Compatibility with vllm==0.3.*

New Models:

qwen1.5 series
openbmb series

What's Changed

update openbmb sh by @Jintao-Huang in https://github.com/modelscope/swift/pull/361
Fix openbmb model name by @tastelikefeet in https://github.com/modelscope/swift/pull/362
support dpo cli and add examples controlnet and dreambooth by @slin000111 in https://github.com/modelscope/swift/pull/344
support openbmb minicpm by @Jintao-Huang in https://github.com/modelscope/swift/pull/364
Support agent training, etc. by @tastelikefeet in https://github.com/modelscope/swift/pull/352
fix tuner by @tastelikefeet in https://github.com/modelscope/swift/pull/365
Fix agent doc by @tastelikefeet in https://github.com/modelscope/swift/pull/366
Fix data format in readme by @tastelikefeet in https://github.com/modelscope/swift/pull/367
fix lazy_tokenize bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/369
Fix length penalty by @Jintao-Huang in https://github.com/modelscope/swift/pull/371
fix loss by @tastelikefeet in https://github.com/modelscope/swift/pull/372
update compute loss by @Jintao-Huang in https://github.com/modelscope/swift/pull/375
fix system='' bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/374
fix system='' bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/378
Support PAI compat by @Jintao-Huang in https://github.com/modelscope/swift/pull/373
fix doc by @tastelikefeet in https://github.com/modelscope/swift/pull/376
Fix the conflict between agent and CT by @tastelikefeet in https://github.com/modelscope/swift/pull/379
fix cogagent_18b_chat sh typo error by @Jintao-Huang in https://github.com/modelscope/swift/pull/381
Fix loss scale by @tastelikefeet in https://github.com/modelscope/swift/pull/383
Feat/qwen1.5 by @tastelikefeet in https://github.com/modelscope/swift/pull/385
fix template name by @tastelikefeet in https://github.com/modelscope/swift/pull/389
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/386
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/390
Support max model len by @Jintao-Huang in https://github.com/modelscope/swift/pull/392
Support vllm max model len by @Jintao-Huang in https://github.com/modelscope/swift/pull/394
fix arguments bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/395
support vllm 0.3 by @Jintao-Huang in https://github.com/modelscope/swift/pull/396
fix deepspeed_config_path bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/398
fix file name by @slin000111 in https://github.com/modelscope/swift/pull/397
Add qwen1.5 scripts by @tastelikefeet in https://github.com/modelscope/swift/pull/393
fix many bugs by @Jintao-Huang in https://github.com/modelscope/swift/pull/399
Fix baichuan2 int4 bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/400
Fix qwen1half deploy bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/402
fix readme and test_llm by @tastelikefeet in https://github.com/modelscope/swift/pull/404
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/405

Full Changelog: https://github.com/modelscope/swift/compare/v1.5.4...v1.6.0

v1.5.4

3 months ago

New Features:

Default zero3.json file
Enhanced support for multi-modal models

New Models:

Orion series
Codefuse series
Internlm2-math series
Internlm2 series
Qwen2 series
Yi-vl series
Internlm-xcomposer2

What's Changed

Update orion 14b by @Jintao-Huang in https://github.com/modelscope/swift/pull/341
update codefuse series by @Jintao-Huang in https://github.com/modelscope/swift/pull/343
Support yi vl by @Jintao-Huang in https://github.com/modelscope/swift/pull/345
fix yi-vl finetune error by @Yimi81 in https://github.com/modelscope/swift/pull/347
Fix template encode bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/348
Update internlm2 math by @Jintao-Huang in https://github.com/modelscope/swift/pull/349
Removing eos_token when doing inference. by @Jintao-Huang in https://github.com/modelscope/swift/pull/351
Support zero3 by @Jintao-Huang in https://github.com/modelscope/swift/pull/353
Support internlm xcomposer2 by @Jintao-Huang in https://github.com/modelscope/swift/pull/354
update qwen2 by @Jintao-Huang in https://github.com/modelscope/swift/pull/355
fix baichuan2 bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/357
fix template_encode bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/358
Fix issue 342 by @Jintao-Huang in https://github.com/modelscope/swift/pull/359
fix test_run.py by @Jintao-Huang in https://github.com/modelscope/swift/pull/360

New Contributors

@Yimi81 made their first contribution in https://github.com/modelscope/swift/pull/347

Full Changelog: https://github.com/modelscope/swift/compare/v1.5.3...v1.5.4

v1.5.2

4 months ago

English Version

Support show log in text box of web-ui
Support share=True in web-ui, only need to set WEBUI_SHARE=1 in environment variable
Support deactivate all adapters
Support more SFT arguments
Add longlora/qalora script
Support custom models in web-ui
ModelScope SWIFT studio released: https://www.modelscope.cn/studios/damo/Scalable-lightWeight-Infrastructure-for-Fine-Tuning/summary
Fix some bugs

中文版本

支持在web-ui中直接显示日志
支持share=True 仅需要在环境变量中设置WEBUI_SHARE=1
支持失活所有adapters
添加了更多SFT参数
添加了longlora/qalora的训练脚本
web-ui支持了自己注册的自定义模型
SWFT魔搭创空间上线了: https://www.modelscope.cn/studios/damo/Scalable-lightWeight-Infrastructure-for-Fine-Tuning/summary
修复了一些bug

What's Changed

fix chatglm3 template bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/298
Support studio by @tastelikefeet in https://github.com/modelscope/swift/pull/300
fix text label by @tastelikefeet in https://github.com/modelscope/swift/pull/301
fix a bug may cause module on gpu throws error by @tastelikefeet in https://github.com/modelscope/swift/pull/302
fix_ziya_template_bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/303

Full Changelog: https://github.com/modelscope/swift/compare/v1.5.1...v1.5.2

v1.5.1

4 months ago

English version

New Features

Support dtype settings in LoRA
Support deactivated tuners offloading
Support deployment with OpenAI format restful API
Make LongLoRA supports the latest llama2 code

新feature

支持LoRA设置dtype类型
支持将不使用的tuners offloading到cpu和meta设备上
支持OpenAI restful API方式的部署
LongLoRA支持最新的llama2代码

What's Changed

update docs by @tastelikefeet in https://github.com/modelscope/swift/pull/269
Update benchmark 0101 by @Jintao-Huang in https://github.com/modelscope/swift/pull/271
update benchmark by @Jintao-Huang in https://github.com/modelscope/swift/pull/276
Support dtype in lora by @tastelikefeet in https://github.com/modelscope/swift/pull/278
Support deploy by @Jintao-Huang in https://github.com/modelscope/swift/pull/275
update deploy client by @Jintao-Huang in https://github.com/modelscope/swift/pull/279
Support offload by @tastelikefeet in https://github.com/modelscope/swift/pull/281
fix tuner bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/285
Support lora_modules_to_save by @Jintao-Huang in https://github.com/modelscope/swift/pull/284
update template by @Jintao-Huang in https://github.com/modelscope/swift/pull/286
support ModuleToSave original module offloading by @tastelikefeet in https://github.com/modelscope/swift/pull/282
Fix offload by @tastelikefeet in https://github.com/modelscope/swift/pull/288
fix scedit bug by @tastelikefeet in https://github.com/modelscope/swift/pull/290
fix bnb qwen bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/289
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/291
update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/292
Fix/longlora by @tastelikefeet in https://github.com/modelscope/swift/pull/294
support additional_trainable_parameters by @Jintao-Huang in https://github.com/modelscope/swift/pull/295
fix webui by @tastelikefeet in https://github.com/modelscope/swift/pull/296
fix trainer push to hub by @Jintao-Huang in https://github.com/modelscope/swift/pull/297

Full Changelog: https://github.com/modelscope/swift/compare/v1.5.0...v1.5.1

v1.5.0

4 months ago

English Version

New features:

Support multi-line inference
Support multi node training
Add benchmarks
Support UI training, start by swift web-ui
Support VLLM inference
Support RLHF(DPO) training

New tuners:

SCEdit, adopted by TongYi Lab, uses lesser memory but produces better performance than LoRA, and can be used to replace ControlNet in a series of scenarios like Pose control/In-Painting/Out-Paining/Label-removing, etc.

New models:

SUS series models
Mixtral-MoE series models
deepseek series models
phi2-3b
cogagent-chat/cogagent-vqa
codegeex2-6b

New datasets:

Datasets used in RLHF:

hh-rlhf
stack-exchange-paired

中文版

SWIFT本月新版本已经发布！

新特性:

支持多行输入推理
支持多卡训练
添加了模型训练的benchmarks
支持界面训练和推理，通过swift web-ui开启
支持VLLM推理
支持RLHF(DPO)训练

新tuners:

SCEdit: 通义实验室自研的优秀U-Net微调框架，显存占用远小于LoRA，效果较LoRA更好，且可以替代实现ControlNet的效果，实现In-Painting/Out-Paining/标签去除/Pose控制等能力

新模型：

SUS系列模型 Mixtral-MoE系列模型 deepseek系列模型 phi2-3b cogagent-chat/cogagent-vqa codegeex2-6b

新数据集:

用于RLHF的数据集： hh-rlhf stack-exchange-paired

What's Changed

update multi-line input (infer) by @Jintao-Huang in https://github.com/modelscope/swift/pull/196
Fix model saving in new format by @tastelikefeet in https://github.com/modelscope/swift/pull/198
Fix compatible error by @tastelikefeet in https://github.com/modelscope/swift/pull/201
Fix bug 1206 by @Jintao-Huang in https://github.com/modelscope/swift/pull/202
fix fp16 & full bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/203
Fix qwen-audio inference bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/204
Support multi node by @Jintao-Huang in https://github.com/modelscope/swift/pull/205
fix typo bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/206
Support sus by @Jintao-Huang in https://github.com/modelscope/swift/pull/207
Support cpu by @Jintao-Huang in https://github.com/modelscope/swift/pull/208
Add Feat: Freeze Parameters, disable_tqdm by @Jintao-Huang in https://github.com/modelscope/swift/pull/210
update dataset by @Jintao-Huang in https://github.com/modelscope/swift/pull/212
Support lazy_tokenize, preprocess_num_proc by @Jintao-Huang in https://github.com/modelscope/swift/pull/211
Support Mixtral MoE by @tastelikefeet in https://github.com/modelscope/swift/pull/217
Add benchmark by @Jintao-Huang in https://github.com/modelscope/swift/pull/213
support ui training by @tastelikefeet in https://github.com/modelscope/swift/pull/219
Fix transformers 4.36 by @Jintao-Huang in https://github.com/modelscope/swift/pull/218
Update mixtral-7b-moe by @Jintao-Huang in https://github.com/modelscope/swift/pull/221
Compatible with peft>=0.7.0 by @tastelikefeet in https://github.com/modelscope/swift/pull/220
fix dtype='fp16' sft bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/227
fix generation_config warning by @Jintao-Huang in https://github.com/modelscope/swift/pull/224
Fix merge_lora & model_cache_dir bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/229
fix lazy_tokenize bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/228
Add inference UI and refactor machenism by @tastelikefeet in https://github.com/modelscope/swift/pull/230
Support deepseek by @Jintao-Huang in https://github.com/modelscope/swift/pull/223
relax version restriction by @tastelikefeet in https://github.com/modelscope/swift/pull/232
fix bug 1218 by @Jintao-Huang in https://github.com/modelscope/swift/pull/235
support deployment by @Jintao-Huang in https://github.com/modelscope/swift/pull/231
update docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/238
Refactor some code by @tastelikefeet in https://github.com/modelscope/swift/pull/237
fix typo bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/239
update readme & phi2-3b by @Jintao-Huang in https://github.com/modelscope/swift/pull/241
Fix argument 1220 by @Jintao-Huang in https://github.com/modelscope/swift/pull/242
Support CogAgent by @tastelikefeet in https://github.com/modelscope/swift/pull/243
fix infer by @tastelikefeet in https://github.com/modelscope/swift/pull/244
Support more peft tuners by @tastelikefeet in https://github.com/modelscope/swift/pull/245
Fix copying additional files by @tastelikefeet in https://github.com/modelscope/swift/pull/247
Add sft for codegeex2 by @tastelikefeet in https://github.com/modelscope/swift/pull/248
fix issue #249 by @tastelikefeet in https://github.com/modelscope/swift/pull/250
Feat/scedit by @jiangzeyinzi in https://github.com/modelscope/swift/pull/253
Update 1228 by @Jintao-Huang in https://github.com/modelscope/swift/pull/254
fix unicode error by @tastelikefeet in https://github.com/modelscope/swift/pull/259
Update readme for SCEdit by @tastelikefeet in https://github.com/modelscope/swift/pull/258
DPO by @tastelikefeet in https://github.com/modelscope/swift/pull/255
update self-cognition by @Jintao-Huang in https://github.com/modelscope/swift/pull/261
Fix/1229 by @tastelikefeet in https://github.com/modelscope/swift/pull/260
fix trainer init by @tastelikefeet in https://github.com/modelscope/swift/pull/262
fix bugs by @tastelikefeet in https://github.com/modelscope/swift/pull/263
fix import by @tastelikefeet in https://github.com/modelscope/swift/pull/265
Fix import by @tastelikefeet in https://github.com/modelscope/swift/pull/266
update perf by @Jintao-Huang in https://github.com/modelscope/swift/pull/264
fix bug by @tastelikefeet in https://github.com/modelscope/swift/pull/267
Support win32 by @tastelikefeet in https://github.com/modelscope/swift/pull/268

Full Changelog: https://github.com/modelscope/swift/compare/v1.4.0...v1.5.0