Commit Graph

45 Commits

Author SHA1 Message Date
josc146
b41a2e7039 move state cache to memory (todo: state cache db) 2023-06-02 21:33:57 +08:00
josc146
b63370928d macOS 2023-06-01 16:54:21 +08:00
josc146
2f5a7d2d51 fix_tokens 2023-05-31 16:07:09 +08:00
josc146
cf16e54463 fix_tokens 2023-05-31 14:55:13 +08:00
josc146
c8b2bb53ef improve system for rwkv-4-world 2023-05-31 12:46:06 +08:00
josc146
8291c50058 safe ModelConfigBody 2023-05-30 23:13:27 +08:00
josc146
9945338458 chore 2023-05-30 11:52:33 +08:00
josc146
53b6a5ffe0 allow system to be placed anywhere 2023-05-29 22:26:22 +08:00
josc146
da033ab096 chore 2023-05-29 20:51:20 +08:00
josc146
142e30622e send response even token is END_OF_TEXT 2023-05-29 20:17:29 +08:00
josc146
55bb33bcbb embed all core dependencies 2023-05-29 20:14:42 +08:00
josc146
6fc5a335fb embed dependencies 2023-05-29 09:39:16 +08:00
josc146
fecdf238c1 feat: preload preset_system 2023-05-29 00:08:13 +08:00
josc146
3e11128c9d feat: use model state cache to achieve 5x - 50x faster preparation time for generation 2023-05-28 23:52:38 +08:00
josc146
94971bb666 support for rwkv-4-world 2023-05-28 12:53:14 +08:00
josc146
b7fb8ed898 improve api concurrency performance 2023-05-27 15:18:12 +08:00
josc146
06622b79aa update rwkv_generate 2023-05-25 20:34:42 +08:00
josc146
bb8af451f6 fix cuda40 kernel 2023-05-25 00:22:09 +08:00
josc146
77ce87d209 update cuda40 kernel 2023-05-24 22:18:14 +08:00
josc146
f439b3d382 add api host setting 2023-05-24 22:03:30 +08:00
josc146
bcb38d991a add role: "system" support 2023-05-24 14:01:22 +08:00
josc146
c741b2a203 fix api completion_lock (#6) 2023-05-24 11:45:55 +08:00
josc146
9a3657e6ea delete cache before updating 2023-05-23 12:37:13 +08:00
josc146
1d08719645 update requirements and /status 2023-05-23 12:13:12 +08:00
josc146
524d9e78e6 SwitchModelBody.customCuda 2023-05-23 11:51:43 +08:00
josc146
7989e93afe fixed torch version; CUDA acceleration utils 2023-05-23 11:19:39 +08:00
josc146
375af3bc1a improve compatible API 2023-05-22 11:24:57 +08:00
josc146
85493da730 add compatible /v1/completions API 2023-05-22 11:18:37 +08:00
josc146
74ceffb32c fix completion_text 2023-05-21 23:25:58 +08:00
josc146
c3084a3290 fix py lock 2023-05-21 13:46:54 +08:00
josc146
b8f7582513 chore & auto dep 2023-05-20 23:34:33 +08:00
josc146
9076ff3fd7 upload dep_check 2023-05-20 21:32:20 +08:00
josc146
82ea93ef3d update 2023-05-20 17:07:27 +08:00
josc146
5883686003 chore 2023-05-20 15:33:38 +08:00
josc146
752b72e2c9 improve chat page 2023-05-19 20:10:30 +08:00
josc146
7ba90ae7af detect status 2023-05-19 15:59:04 +08:00
josc146
1105fbf6ec chat page 2023-05-19 14:22:37 +08:00
josc146
934f7b15e8 i18n notifications and details 2023-05-18 21:19:13 +08:00
josc146
df8eef5f64 update 2023-05-17 21:20:41 +08:00
josc146
11813454de update 2023-05-17 11:47:45 +08:00
josc146
c947052574 preliminary usable features 2023-05-17 11:39:00 +08:00
josc146
83f0bb503c update 2023-05-15 21:55:57 +08:00
josc146
9763de8f64 update 2023-05-07 22:48:52 +08:00
josc146
0e852daf43 backend api 2023-05-07 17:27:54 +08:00
josc146
ac3e34e1d8 update 2023-05-06 20:17:39 +08:00