josc146
|
9ed3547738
|
rwkv pip 0.8.0
|
2023-06-28 19:36:15 +08:00 |
|
josc146
|
131a7ddf4a
|
fix the prompt cache that contains potential error
|
2023-06-21 16:07:16 +08:00 |
|
josc146
|
e93c77394d
|
add usage
|
2023-06-20 15:55:52 +08:00 |
|
josc146
|
8963543159
|
embeddings api compatible with openai api and langchain(sdk)
|
2023-06-19 22:51:06 +08:00 |
|
josc146
|
d32351c130
|
exact model name
|
2023-06-19 22:30:49 +08:00 |
|
josc146
|
967be6f88f
|
refactor completions api
|
2023-06-18 20:16:52 +08:00 |
|
josc146
|
21c3009945
|
improve api docs
|
2023-06-15 21:52:22 +08:00 |
|
josc146
|
8431b5d24f
|
log Generation Prompt
|
2023-06-12 13:41:51 +08:00 |
|
josc146
|
cea1d8b4d1
|
add logs for state cache and switch-model
|
2023-06-09 20:46:19 +08:00 |
|
josc146
|
4e75531651
|
fix the crash issue caused by temperature being 0
|
2023-06-04 11:53:33 +08:00 |
|
josc146
|
2f5a7d2d51
|
fix_tokens
|
2023-05-31 16:07:09 +08:00 |
|
josc146
|
cf16e54463
|
fix_tokens
|
2023-05-31 14:55:13 +08:00 |
|
josc146
|
c8b2bb53ef
|
improve system for rwkv-4-world
|
2023-05-31 12:46:06 +08:00 |
|
josc146
|
8291c50058
|
safe ModelConfigBody
|
2023-05-30 23:13:27 +08:00 |
|
josc146
|
da033ab096
|
chore
|
2023-05-29 20:51:20 +08:00 |
|
josc146
|
142e30622e
|
send response even token is END_OF_TEXT
|
2023-05-29 20:17:29 +08:00 |
|
josc146
|
fecdf238c1
|
feat: preload preset_system
|
2023-05-29 00:08:13 +08:00 |
|
josc146
|
3e11128c9d
|
feat: use model state cache to achieve 5x - 50x faster preparation time for generation
|
2023-05-28 23:52:38 +08:00 |
|
josc146
|
94971bb666
|
support for rwkv-4-world
|
2023-05-28 12:53:14 +08:00 |
|
josc146
|
06622b79aa
|
update rwkv_generate
|
2023-05-25 20:34:42 +08:00 |
|
josc146
|
524d9e78e6
|
SwitchModelBody.customCuda
|
2023-05-23 11:51:43 +08:00 |
|
josc146
|
7989e93afe
|
fixed torch version; CUDA acceleration utils
|
2023-05-23 11:19:39 +08:00 |
|
josc146
|
c947052574
|
preliminary usable features
|
2023-05-17 11:39:00 +08:00 |
|
josc146
|
83f0bb503c
|
update
|
2023-05-15 21:55:57 +08:00 |
|
josc146
|
0e852daf43
|
backend api
|
2023-05-07 17:27:54 +08:00 |
|