Commit Graph

182 Commits

Author SHA1 Message Date
josc146
7235e1067b add deployment mode. If /switch-model with deploy: true, will disable /switch-model, /exit and other dangerous APIs (state cache APIs, part of midi APIs) 2023-11-08 23:29:42 +08:00
josc146
d249a4c29a print error.txt 2023-11-08 22:57:38 +08:00
josc146
cfa3669f6f fix /docs default api params (Pydantic v2) 2023-11-07 22:53:11 +08:00
josc146
db6fbe8366 add python webui server 2023-11-07 22:22:29 +08:00
josc146
64826b9af7 fix log encoding error 2023-11-05 21:00:31 +08:00
josc146
47b0c35441 update ngrok_connect 2023-11-04 20:22:28 +08:00
josc146
1dcda47013 improve startup process 2023-11-04 20:21:55 +08:00
josc146
1f81a1e5a8 upgrade to rwkv 0.8.20 2023-11-03 23:27:14 +08:00
josc146
14b90bb36b improve dml mode performance (20% faster, https://github.com/BlinkDL/ChatRWKV/pull/181) 2023-10-30 20:24:57 +08:00
josc146
f86b7f1f08 python38 compatibility 2023-10-29 14:11:11 +08:00
josc146
ff7306349a improve memory usage of state cache 2023-10-28 23:04:49 +08:00
josc146
c87de93498 allow conversation with some document (.pdf, .txt) 2023-10-27 11:36:29 +08:00
josc146
faf1852012 update stop strategy 2023-10-26 17:47:40 +08:00
josc146
43cfab5d4b change default World series prefix to User/Assistant 2023-10-26 16:58:53 +08:00
josc146
627a20936d RWKVType now no longer relies on the file name 2023-10-26 16:55:33 +08:00
josc146
d7ba88953d chore 2023-10-25 22:53:14 +08:00
josc146
30e1c3171e update kernel (CUDA Compute Capability 5.3) 2023-10-25 22:53:14 +08:00
josc146
1f058b16ac update kernel (CUDA Compute Capability 6.1, Previously 7.5) 2023-10-25 22:53:13 +08:00
josc146
4a192f4057 upgrade to webgpu 0.2.2 (https://github.com/josStorer/ai00_rwkv_server) 2023-10-25 21:02:44 +08:00
josc146
0331bf47f7 upgrade rwkv 0.8.16 (DirectML support; rwkv 5.2 no longer needs to ensure custom cuda kernel enabled) 2023-10-25 17:56:18 +08:00
josc146
2acdaa96b2 chore 2023-10-25 17:51:59 +08:00
josc146
1d200d53ab fix beta linux kernel 2023-10-25 17:51:13 +08:00
josc146
df9e1f408e add /file-to-text api 2023-10-25 17:14:33 +08:00
josc146
46b3b285f5 upgrade packages 2023-10-25 17:07:40 +08:00
josc146
0005816c1d fix linux kernel (partial revert 68228a45) 2023-10-05 00:08:18 +08:00
josc146
68228a4552 rwkv5 pre-compiled kernel (for windows) 2023-10-03 13:39:07 +08:00
josc146
79851433f8 upgrade rwkv pip (0.8.13) 2023-10-03 13:33:55 +08:00
josc146
d7abe5f0d1 add pre-compiled beta cuda kernel (rwkv-beta==0.8.5, 40%+ faster for fp16) (thanks to #180, pre-compiled kernel of RTX 40 Series will be included later) 2023-09-18 23:02:49 +08:00
josc146
5e5e1e9651 custom tokenizer .txt support 2023-09-18 17:20:55 +08:00
josc146
a25965530c custom tokenizer (#77) 2023-09-16 00:34:11 +08:00
josc146
d7dcc90008 chore 2023-09-15 16:31:14 +08:00
josc146
df969fcfc6 upgrade cuda-beta 2023-09-15 16:30:11 +08:00
josc146
50ff7ef6bc always use requirements.txt 2023-08-27 23:52:52 +08:00
josc146
a24b78e8c3 python-backend: extra ChatCompletionBody params (raw, presystem);
add default_stop when stop is null
2023-08-27 21:21:11 +08:00
josc146
c8025f1cff allow message content to be empty 2023-08-27 21:02:54 +08:00
josc146
02d5d641d1 chore 2023-08-24 22:48:54 +08:00
josc146
ef53951a16 webgpu support 2023-08-16 23:07:58 +08:00
josc146
61cea2a784 add misc API (/models and /dashboard/billing/credit_grants) 2023-08-14 23:37:55 +08:00
josc146
8a13bd3c1e add rwkv-cuda-beta support (faster) 2023-08-14 22:07:15 +08:00
josc146
da68926e9c chore (AddStateBody class) 2023-08-13 21:27:29 +08:00
josc146
e0b7453883 allow multiple systems 2023-08-04 22:27:55 +08:00
josc146
91e2828a95 allow completions input to be null 2023-08-04 22:22:59 +08:00
josc146
b3e35a4cdd allow custom user_name and assistant_name (/chat/completions API) 2023-07-31 22:48:54 +08:00
josc146
8764c37b03 RWKVType 2023-07-31 22:46:13 +08:00
josc146
d12a173f39 global penalty 2023-07-31 22:02:28 +08:00
josc146
aecacde819 remove response field of completions api 2023-07-29 19:20:43 +08:00
josc146
3ef22239eb improve default ChatCompletion stop 2023-07-29 19:19:38 +08:00
josc146
719090cc8c improve python backend startup speed 2023-07-29 19:18:01 +08:00
josc146
9d89b6f4db fix params 2023-07-28 22:13:19 +08:00
josc146
d0fd480bd6 chore 2023-07-26 22:24:26 +08:00
josc146
1df345b5eb improve embeddings API results 2023-07-25 20:30:43 +08:00
josc146
77868c798b chore 2023-07-25 16:37:06 +08:00
josc146
f56748a941 improve python backend startup speed 2023-07-25 16:14:29 +08:00
josc146
29c5b1d804 add midi api 2023-07-25 16:11:17 +08:00
josc146
34095a6c36 support for stop array 2023-07-25 16:10:22 +08:00
josc146
05b9b42b56 add support for MIDI RWKV 2023-07-25 16:09:31 +08:00
josc146
9b3b06ab04 fix input with array type (#96, #107) 2023-07-17 12:59:45 +08:00
josc146
994fc7c828 fix cross-device state cache exception 2023-07-11 11:20:12 +08:00
josc146
f9f1d5c9fc improve /completions api compatibility 2023-07-10 20:45:08 +08:00
josc146
6fbb86667c improve python script error messages 2023-07-07 20:16:35 +08:00
josc146
987854fe49 lora finetune (need to be refactored) 2023-07-03 17:41:47 +08:00
josc146
417389c5f6 improve for python3.8 3.9 2023-06-29 20:12:11 +08:00
josc146
9ed3547738 rwkv pip 0.8.0 2023-06-28 19:36:15 +08:00
josc146
131a7ddf4a fix the prompt cache that contains potential error 2023-06-21 16:07:16 +08:00
josc146
43bc08648d update manifest 2023-06-20 16:07:52 +08:00
josc146
e93c77394d add usage 2023-06-20 15:55:52 +08:00
josc146
8963543159 embeddings api compatible with openai api and langchain(sdk) 2023-06-19 22:51:06 +08:00
josc146
377f71b16b type 2023-06-19 22:32:02 +08:00
josc146
d32351c130 exact model name 2023-06-19 22:30:49 +08:00
josc146
967be6f88f refactor completions api 2023-06-18 20:16:52 +08:00
josc146
721653a812 fix the state cache crash caused by bad prompts 2023-06-15 22:37:00 +08:00
josc146
21c3009945 improve api docs 2023-06-15 21:52:22 +08:00
josc146
51c5696bb9 improve python dependencies installation 2023-06-14 22:21:17 +08:00
josc146
714b8834c7 chore 2023-06-13 22:47:17 +08:00
josc146
5896593951 max_trie_len 2023-06-12 15:22:17 +08:00
josc146
8431b5d24f log Generation Prompt 2023-06-12 13:41:51 +08:00
josc146
bbd1ac1484 allow unloading model with switch-model 2023-06-12 12:34:03 +08:00
josc146
5990567a79 avoid misoperations of state_cache 2023-06-12 12:32:50 +08:00
josc146
fa0fcc2c89 add support for python3.8 3.9 2023-06-12 12:09:23 +08:00
josc146
cea1d8b4d1 add logs for state cache and switch-model 2023-06-09 20:46:19 +08:00
josc146
635767408f fix UnboundLocalError: local variable 'response' referenced before assignment 2023-06-08 13:30:34 +08:00
josc146
9bd9b9ecbd add requirements_without_cyac.txt 2023-06-05 22:58:56 +08:00
josc146
4e75531651 fix the crash issue caused by temperature being 0 2023-06-04 11:53:33 +08:00
josc146
edc6ac7297 chore 2023-06-03 20:34:33 +08:00
josc146
966b912013 improve logs 2023-06-03 19:28:37 +08:00
josc146
dc71054e61 improve logs 2023-06-03 17:36:50 +08:00
josc146
38b775c937 add logs 2023-06-03 17:12:59 +08:00
josc146
b41a2e7039 move state cache to memory (todo: state cache db) 2023-06-02 21:33:57 +08:00
josc146
b63370928d macOS 2023-06-01 16:54:21 +08:00
josc146
2f5a7d2d51 fix_tokens 2023-05-31 16:07:09 +08:00
josc146
cf16e54463 fix_tokens 2023-05-31 14:55:13 +08:00
josc146
c8b2bb53ef improve system for rwkv-4-world 2023-05-31 12:46:06 +08:00
josc146
8291c50058 safe ModelConfigBody 2023-05-30 23:13:27 +08:00
josc146
9945338458 chore 2023-05-30 11:52:33 +08:00
josc146
53b6a5ffe0 allow system to be placed anywhere 2023-05-29 22:26:22 +08:00
josc146
da033ab096 chore 2023-05-29 20:51:20 +08:00
josc146
142e30622e send response even token is END_OF_TEXT 2023-05-29 20:17:29 +08:00
josc146
55bb33bcbb embed all core dependencies 2023-05-29 20:14:42 +08:00
josc146
6fc5a335fb embed dependencies 2023-05-29 09:39:16 +08:00
josc146
fecdf238c1 feat: preload preset_system 2023-05-29 00:08:13 +08:00