RWKV Runner Versions Save

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

v1.8.1

6 days ago

Changes

Features

  • add support for dynamic state-tuned models

image

Upgrades

Improvements

  • add tps console output
  • add torch cnMirror
  • disable pre_ffn and head_qk
  • improve frontend details

Chores

  • update manifest.json and defaultModelConfigs

Install

v1.7.9

2 weeks ago

Changes

  • bump webgpu mode ai00_server v0.4.2 (huge performance improvement)
  • upgrade to rwkv 0.8.26 (state-tuned model support)
  • update defaultConfigs and manifest.json
  • chores

Breaking Changes

  • change the default value of presystem to false

For the convenience of using the future state-tuned models, the default value of presystem has been set to false. This means that the RWKV-Runner service will no longer automatically insert recommended RWKV pre-prompts for you:

User: hi

Assistant: Hi. I am your assistant and I will provide expert full response in full details. Please feel free to ask any question and I will always answer it.

If you are using the API service and conducting a rigorous RWKV conversation, please manually send the above messages to the /chat/completions API's messages array, or manually send presystem: true to have the server automatically insert pre-prompts.

If you are using the RWKV-Runner client for chatting, you can enable Insert default system prompt at the beginning in the preset editor.

Of course, in reality, even if you do not perform the above, there is usually no significant negative impact.

If you are using the new RWKV state-tuned models, you do not need to perform the above.

The new RWKV state-tuned models can be downloaded here, they are very interesting:

If you are interested in state-tuning, please refer to: https://github.com/BlinkDL/RWKV-LM#state-tuning-tuning-the-initial-state-zero-inference-overhead

Install

v1.7.6

1 month ago

Changes

Features

Improvements

  • improve preset editor
  • better compatibility for custom api (ollama etc.) image
  • throttling saveConfigs
  • improve error messages
  • other details

Install

v1.7.4

2 months ago

Changes

Features

Improvements

  • improve markdown rendering
  • improve theme
  • improve usability
  • for Chinese users, replace Tsinghua pip mirrors with Alibaba Cloud to avoid 403 http error

Install

v1.7.3

2 months ago

Changes

Features

  • add Docker support (#291) @LonghronShen

Fixes

  • fix a generation exception caused by potentially dangerous regex being passed into the stop array
  • fix max_tokens parameter of Chat page not being passed to backend
  • fix the issue where penalty_decay and global_penalty are not being passed to the backend default config when running the model through client

Improvements

  • prevent 'torch' has no attribute 'cuda' error in torch_gc, so user can use CPU or WebGPU (#302)

Chores

  • bump dependencies
  • add pre-release workflow
  • dep_check.py now ignores GPUtil

Install

v1.7.2

2 months ago