release v1.7.2
This commit is contained in:
parent
2947162cc4
commit
14acfc1d81
@ -1,31 +1,17 @@
|
|||||||
## Changes
|
## Changes
|
||||||
|
|
||||||
**This version includes important bug fixes, it is strongly recommended to upgrade to this version.**
|
|
||||||
|
|
||||||
### Upgrades
|
|
||||||
|
|
||||||
- webgpu 0.3.20 https://github.com/cgisky1980/ai00_rwkv_server
|
|
||||||
|
|
||||||
### Features
|
### Features
|
||||||
|
|
||||||
- allow setting quantizedLayers of WebGPU mode
|
- allow setting tokenChunkSize of WebGPU mode
|
||||||
|
- expose global_penalty
|
||||||
|
|
||||||
### Improvements
|
### Improvements
|
||||||
|
|
||||||
- improve occurrence[token] condition
|
- improve parameters controllable range
|
||||||
- disable AVOID_PENALTY_TOKENS when generating (still enabled when preprocessing)
|
|
||||||
- enable useHfMirror by default for chinese users
|
|
||||||
|
|
||||||
### Fixes
|
|
||||||
|
|
||||||
- fix the issue where state cache could be modified leading to inconsistent hit results
|
|
||||||
- fix convert_safetensors.py for rwkv6
|
|
||||||
- add python3-dev to lora fine-tune dependencies (this may previously lead to the error of v5 fine-tune)
|
|
||||||
|
|
||||||
### Chores
|
### Chores
|
||||||
|
|
||||||
- hide MPS and CUDA-Beta Options
|
- update defaultModelConfigs
|
||||||
- update manifest
|
|
||||||
|
|
||||||
## Install
|
## Install
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user