josc146
|
721653a812
|
fix the state cache crash caused by bad prompts
|
2023-06-15 22:37:00 +08:00 |
|
josc146
|
21c3009945
|
improve api docs
|
2023-06-15 21:52:22 +08:00 |
|
josc146
|
51c5696bb9
|
improve python dependencies installation
|
2023-06-14 22:21:17 +08:00 |
|
josc146
|
714b8834c7
|
chore
|
2023-06-13 22:47:17 +08:00 |
|
josc146
|
5896593951
|
max_trie_len
|
2023-06-12 15:22:17 +08:00 |
|
josc146
|
8431b5d24f
|
log Generation Prompt
|
2023-06-12 13:41:51 +08:00 |
|
josc146
|
bbd1ac1484
|
allow unloading model with switch-model
|
2023-06-12 12:34:03 +08:00 |
|
josc146
|
5990567a79
|
avoid misoperations of state_cache
|
2023-06-12 12:32:50 +08:00 |
|
josc146
|
fa0fcc2c89
|
add support for python3.8 3.9
|
2023-06-12 12:09:23 +08:00 |
|
josc146
|
cea1d8b4d1
|
add logs for state cache and switch-model
|
2023-06-09 20:46:19 +08:00 |
|
josc146
|
635767408f
|
fix UnboundLocalError: local variable 'response' referenced before assignment
|
2023-06-08 13:30:34 +08:00 |
|
josc146
|
9bd9b9ecbd
|
add requirements_without_cyac.txt
|
2023-06-05 22:58:56 +08:00 |
|
josc146
|
4e75531651
|
fix the crash issue caused by temperature being 0
|
2023-06-04 11:53:33 +08:00 |
|
josc146
|
edc6ac7297
|
chore
|
2023-06-03 20:34:33 +08:00 |
|
josc146
|
966b912013
|
improve logs
|
2023-06-03 19:28:37 +08:00 |
|
josc146
|
dc71054e61
|
improve logs
|
2023-06-03 17:36:50 +08:00 |
|
josc146
|
38b775c937
|
add logs
|
2023-06-03 17:12:59 +08:00 |
|
josc146
|
b41a2e7039
|
move state cache to memory (todo: state cache db)
|
2023-06-02 21:33:57 +08:00 |
|
josc146
|
b63370928d
|
macOS
|
2023-06-01 16:54:21 +08:00 |
|
josc146
|
2f5a7d2d51
|
fix_tokens
|
2023-05-31 16:07:09 +08:00 |
|
josc146
|
cf16e54463
|
fix_tokens
|
2023-05-31 14:55:13 +08:00 |
|
josc146
|
c8b2bb53ef
|
improve system for rwkv-4-world
|
2023-05-31 12:46:06 +08:00 |
|
josc146
|
8291c50058
|
safe ModelConfigBody
|
2023-05-30 23:13:27 +08:00 |
|
josc146
|
9945338458
|
chore
|
2023-05-30 11:52:33 +08:00 |
|
josc146
|
53b6a5ffe0
|
allow system to be placed anywhere
|
2023-05-29 22:26:22 +08:00 |
|
josc146
|
da033ab096
|
chore
|
2023-05-29 20:51:20 +08:00 |
|
josc146
|
142e30622e
|
send response even token is END_OF_TEXT
|
2023-05-29 20:17:29 +08:00 |
|
josc146
|
55bb33bcbb
|
embed all core dependencies
|
2023-05-29 20:14:42 +08:00 |
|
josc146
|
6fc5a335fb
|
embed dependencies
|
2023-05-29 09:39:16 +08:00 |
|
josc146
|
fecdf238c1
|
feat: preload preset_system
|
2023-05-29 00:08:13 +08:00 |
|
josc146
|
3e11128c9d
|
feat: use model state cache to achieve 5x - 50x faster preparation time for generation
|
2023-05-28 23:52:38 +08:00 |
|
josc146
|
94971bb666
|
support for rwkv-4-world
|
2023-05-28 12:53:14 +08:00 |
|
josc146
|
b7fb8ed898
|
improve api concurrency performance
|
2023-05-27 15:18:12 +08:00 |
|
josc146
|
06622b79aa
|
update rwkv_generate
|
2023-05-25 20:34:42 +08:00 |
|
josc146
|
bb8af451f6
|
fix cuda40 kernel
|
2023-05-25 00:22:09 +08:00 |
|
josc146
|
77ce87d209
|
update cuda40 kernel
|
2023-05-24 22:18:14 +08:00 |
|
josc146
|
f439b3d382
|
add api host setting
|
2023-05-24 22:03:30 +08:00 |
|
josc146
|
bcb38d991a
|
add role: "system" support
|
2023-05-24 14:01:22 +08:00 |
|
josc146
|
c741b2a203
|
fix api completion_lock (#6)
|
2023-05-24 11:45:55 +08:00 |
|
josc146
|
9a3657e6ea
|
delete cache before updating
|
2023-05-23 12:37:13 +08:00 |
|
josc146
|
1d08719645
|
update requirements and /status
|
2023-05-23 12:13:12 +08:00 |
|
josc146
|
524d9e78e6
|
SwitchModelBody.customCuda
|
2023-05-23 11:51:43 +08:00 |
|
josc146
|
7989e93afe
|
fixed torch version; CUDA acceleration utils
|
2023-05-23 11:19:39 +08:00 |
|
josc146
|
375af3bc1a
|
improve compatible API
|
2023-05-22 11:24:57 +08:00 |
|
josc146
|
85493da730
|
add compatible /v1/completions API
|
2023-05-22 11:18:37 +08:00 |
|
josc146
|
74ceffb32c
|
fix completion_text
|
2023-05-21 23:25:58 +08:00 |
|
josc146
|
c3084a3290
|
fix py lock
|
2023-05-21 13:46:54 +08:00 |
|
josc146
|
b8f7582513
|
chore & auto dep
|
2023-05-20 23:34:33 +08:00 |
|
josc146
|
9076ff3fd7
|
upload dep_check
|
2023-05-20 21:32:20 +08:00 |
|
josc146
|
82ea93ef3d
|
update
|
2023-05-20 17:07:27 +08:00 |
|