37 lines
1.2 KiB
Markdown
37 lines
1.2 KiB
Markdown
## Changes
|
|
|
|
**This version includes important bug fixes, it is strongly recommended to upgrade to this version.**
|
|
|
|
### Upgrades
|
|
|
|
- webgpu 0.3.20 https://github.com/cgisky1980/ai00_rwkv_server
|
|
|
|
### Features
|
|
|
|
- allow setting quantizedLayers of WebGPU mode
|
|
|
|
### Improvements
|
|
|
|
- improve occurrence[token] condition
|
|
- disable AVOID_PENALTY_TOKENS when generating (still enabled when preprocessing)
|
|
- enable useHfMirror by default for chinese users
|
|
|
|
### Fixes
|
|
|
|
- fix the issue where state cache could be modified leading to inconsistent hit results
|
|
- fix convert_safetensors.py for rwkv6
|
|
- add python3-dev to lora fine-tune dependencies (this may previously lead to the error of v5 fine-tune)
|
|
|
|
### Chores
|
|
|
|
- hide MPS and CUDA-Beta Options
|
|
- update manifest
|
|
|
|
## Install
|
|
|
|
- Windows: https://github.com/josStorer/RWKV-Runner/blob/master/build/windows/Readme_Install.txt
|
|
- MacOS: https://github.com/josStorer/RWKV-Runner/blob/master/build/darwin/Readme_Install.txt
|
|
- Linux: https://github.com/josStorer/RWKV-Runner/blob/master/build/linux/Readme_Install.txt
|
|
- Simple Deploy Example: https://github.com/josStorer/RWKV-Runner/blob/master/README.md#simple-deploy-example
|
|
- Server Deploy Examples: https://github.com/josStorer/RWKV-Runner/tree/master/deploy-examples
|