A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
Go to file
2023-12-13 15:23:34 +08:00
.github/workflows allow playing mid with external player 2023-12-12 22:13:09 +08:00
.vscode dev config 2023-06-05 22:57:01 +08:00
assets add Composition Page (RWKV-Music) 2023-07-28 12:30:05 +08:00
backend-golang fix windows cmd waiting 2023-12-12 23:48:32 +08:00
backend-python rwkv.cpp python38 compatibility 2023-12-12 23:19:18 +08:00
backend-rust/assets webgpu support 2023-08-16 23:07:58 +08:00
build update Readme_Install.txt 2023-12-13 15:23:34 +08:00
deploy-examples update setup comments 2023-11-17 20:47:33 +08:00
finetune lora finetune version check 2023-11-30 13:01:38 +08:00
frontend improve prompts 2023-12-12 23:27:19 +08:00
midi add midi api 2023-07-25 16:11:17 +08:00
.gitattributes rwkv.cpp(ggml) support 2023-12-12 20:29:55 +08:00
.gitignore allow playing mid with external player 2023-12-12 22:13:09 +08:00
CURRENT_CHANGE.md release v1.6.2 2023-12-12 23:50:56 +08:00
exportModelsJson.js update manifest.json 2023-05-07 16:09:16 +08:00
go.mod bump to wails v2.7.1 2023-12-10 23:43:40 +08:00
go.sum bump to wails v2.7.1 2023-12-10 23:43:40 +08:00
LICENSE navigate card 2023-05-05 13:41:54 +08:00
main.go add crash.log 2023-12-11 12:02:24 +08:00
Makefile fix lib 2023-11-29 22:59:42 +08:00
manifest.json release v1.6.2 2023-12-12 15:51:23 +00:00
README_JA.md update readme 2023-12-11 17:23:09 +08:00
README_ZH.md update readme 2023-12-11 17:23:09 +08:00
README.md update readme 2023-12-11 17:23:09 +08:00
vendor.yml rwkv.cpp(ggml) support 2023-12-12 20:29:55 +08:00
wails.json init 2023-05-03 23:38:54 +08:00

RWKV Runner

This project aims to eliminate the barriers of using large language models by automating everything for you. All you need is a lightweight executable program of just a few megabytes. Additionally, this project provides an interface compatible with the OpenAI API, which means that every ChatGPT client is an RWKV client.

license release

English | 简体中文 | 日本語

Install

Windows MacOS Linux

FAQs | Preview | Download | Simple Deploy Example | Server Deploy Examples | MIDI Hardware Input

Tip: You can deploy backend-python on a server and use this program as a client only. Fill in your server address in the Settings API URL.

Default configs has enabled custom CUDA kernel acceleration, which is much faster and consumes much less VRAM. If you encounter possible compatibility issues (output garbled), go to the Configs page and turn off Use Custom CUDA kernel to Accelerate, or try to upgrade your gpu driver.

If Windows Defender claims this is a virus, you can try downloading v1.3.7_win.zip and letting it update automatically to the latest version, or add it to the trusted list (Windows Security -> Virus & threat protection -> Manage settings -> Exclusions -> Add or remove exclusions -> Add an exclusion -> Folder -> RWKV-Runner).

For different tasks, adjusting API parameters can achieve better results. For example, for translation tasks, you can try setting Temperature to 1 and Top_P to 0.3.

Features

  • RWKV model management and one-click startup.
  • Front-end and back-end separation, if you don't want to use the client, also allows for separately deploying the front-end service, or the back-end inference service, or the back-end inference service with a WebUI. Simple Deploy Example | Server Deploy Examples
  • Compatible with the OpenAI API, making every ChatGPT client an RWKV client. After starting the model, open http://127.0.0.1:8000/docs to view more details.
  • Automatic dependency installation, requiring only a lightweight executable program.
  • Pre-set multi-level VRAM configs, works well on almost all computers. In Configs page, switch Strategy to WebGPU, it can also run on AMD, Intel, and other graphics cards.
  • User-friendly chat, completion, and composition interaction interface included. Also supports chat presets, attachment uploads, MIDI hardware input, and track editing. Preview | MIDI Hardware Input
  • Built-in WebUI option, one-click start of Web service, sharing your hardware resources.
  • Easy-to-understand and operate parameter configuration, along with various operation guidance prompts.
  • Built-in model conversion tool.
  • Built-in download management and remote model inspection.
  • Built-in one-click LoRA Finetune. (Windows Only)
  • Can also be used as an OpenAI ChatGPT and GPT-Playground client. (Fill in the API URL and API Key in Settings page)
  • Multilingual localization.
  • Theme switching.
  • Automatic updates.

Simple Deploy Example

git clone https://github.com/josStorer/RWKV-Runner

# Then
cd RWKV-Runner
python ./backend-python/main.py #The backend inference service has been started, request /switch-model API to load the model, refer to the API documentation: http://127.0.0.1:8000/docs

# Or
cd RWKV-Runner/frontend
npm ci
npm run build #Compile the frontend
cd ..
python ./backend-python/webui_server.py #Start the frontend service separately
# Or
python ./backend-python/main.py --webui #Start the frontend and backend service at the same time

# Help Info
python ./backend-python/main.py -h

API Concurrency Stress Testing

ab -p body.json -T application/json -c 20 -n 100 -l http://127.0.0.1:8000/chat/completions

body.json:

{
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ]
}

Embeddings API Example

Note: v1.4.0 has improved the quality of embeddings API. The generated results are not compatible with previous versions. If you are using embeddings API to generate knowledge bases or similar, please regenerate.

If you are using langchain, just use OpenAIEmbeddings(openai_api_base="http://127.0.0.1:8000", openai_api_key="sk-")

import numpy as np
import requests


def cosine_similarity(a, b):
    return np.dot(a, b) / (np.linalg.norm(a) * np.linalg.norm(b))


values = [
    "I am a girl",
    "我是个女孩",
    "私は女の子です",
    "广东人爱吃福建人",
    "我是个人类",
    "I am a human",
    "that dog is so cute",
    "私はねこむすめです、にゃん♪",
    "宇宙级特大事件!号外号外!"
]

embeddings = []
for v in values:
    r = requests.post("http://127.0.0.1:8000/embeddings", json={"input": v})
    embedding = r.json()["data"][0]["embedding"]
    embeddings.append(embedding)

compared_embedding = embeddings[0]

embeddings_cos_sim = [cosine_similarity(compared_embedding, e) for e in embeddings]

for i in np.argsort(embeddings_cos_sim)[::-1]:
    print(f"{embeddings_cos_sim[i]:.10f} - {values[i]}")

MIDI Input

Tip: You can download https://github.com/josStorer/sgm_plus and unzip it to the program's assets/sound-font directory to use it as an offline sound source. Please note that if you are compiling the program from source code, do not place it in the source code directory.

USB MIDI Connection

  • USB MIDI devices are plug-and-play, and you can select your input device in the Composition page
  • image

Mac MIDI Bluetooth Connection

  • For Mac users who want to use Bluetooth input, please install Bluetooth MIDI Connect, then click the tray icon to connect after launching, afterwards, you can select your input device in the Composition page.
  • image

Windows MIDI Bluetooth Connection

  • Windows seems to have implemented Bluetooth MIDI support only for UWP (Universal Windows Platform) apps. Therefore, it requires multiple steps to establish a connection. We need to create a local virtual MIDI device and then launch a UWP application. Through this UWP application, we will redirect Bluetooth MIDI input to the virtual MIDI device, and then this software will listen to the input from the virtual MIDI device.
  • So, first, you need to download loopMIDI to create a virtual MIDI device. Click the plus sign in the bottom left corner to create the device.
  • image
  • Next, you need to download Bluetooth LE Explorer to discover and connect to Bluetooth MIDI devices. Click "Start" to search for devices, and then click "Pair" to bind the MIDI device.
  • image
  • Finally, you need to install MIDIberry, This UWP application can redirect Bluetooth MIDI input to the virtual MIDI device. After launching it, double-click your actual Bluetooth MIDI device name in the input field, and in the output field, double-click the virtual MIDI device name we created earlier.
  • image
  • Now, you can select the virtual MIDI device as the input in the Composition page. Bluetooth LE Explorer no longer needs to run, and you can also close the loopMIDI window, it will run automatically in the background. Just keep MIDIberry open.
  • image

Preview

Homepage

image

Chat

image

image

Completion

image

Composition

Tip: You can download https://github.com/josStorer/sgm_plus and unzip it to the program's assets/sound-font directory to use it as an offline sound source. Please note that if you are compiling the program from source code, do not place it in the source code directory.

image

image

Configuration

image

Model Management

image

Download Management

image

LoRA Finetune

image

Settings

image