release v1.7.2

This commit is contained in:
josc146 2024-03-02 19:47:53 +08:00
parent 2947162cc4
commit 14acfc1d81

View File

@ -1,31 +1,17 @@
## Changes ## Changes
**This version includes important bug fixes, it is strongly recommended to upgrade to this version.**
### Upgrades
- webgpu 0.3.20 https://github.com/cgisky1980/ai00_rwkv_server
### Features ### Features
- allow setting quantizedLayers of WebGPU mode - allow setting tokenChunkSize of WebGPU mode
- expose global_penalty
### Improvements ### Improvements
- improve occurrence[token] condition - improve parameters controllable range
- disable AVOID_PENALTY_TOKENS when generating (still enabled when preprocessing)
- enable useHfMirror by default for chinese users
### Fixes
- fix the issue where state cache could be modified leading to inconsistent hit results
- fix convert_safetensors.py for rwkv6
- add python3-dev to lora fine-tune dependencies (this may previously lead to the error of v5 fine-tune)
### Chores ### Chores
- hide MPS and CUDA-Beta Options - update defaultModelConfigs
- update manifest
## Install ## Install