Commit Graph

76 Commits

Author SHA1 Message Date
josc146
d32351c130 exact model name 2023-06-19 22:30:49 +08:00
josc146
967be6f88f refactor completions api 2023-06-18 20:16:52 +08:00
josc146
21c3009945 improve api docs 2023-06-15 21:52:22 +08:00
josc146
714b8834c7 chore 2023-06-13 22:47:17 +08:00
josc146
8431b5d24f log Generation Prompt 2023-06-12 13:41:51 +08:00
josc146
cea1d8b4d1 add logs for state cache and switch-model 2023-06-09 20:46:19 +08:00
josc146
4e75531651 fix the crash issue caused by temperature being 0 2023-06-04 11:53:33 +08:00
josc146
38b775c937 add logs 2023-06-03 17:12:59 +08:00
josc146
2f5a7d2d51 fix_tokens 2023-05-31 16:07:09 +08:00
josc146
cf16e54463 fix_tokens 2023-05-31 14:55:13 +08:00
josc146
c8b2bb53ef improve system for rwkv-4-world 2023-05-31 12:46:06 +08:00
josc146
8291c50058 safe ModelConfigBody 2023-05-30 23:13:27 +08:00
josc146
da033ab096 chore 2023-05-29 20:51:20 +08:00
josc146
142e30622e send response even token is END_OF_TEXT 2023-05-29 20:17:29 +08:00
josc146
fecdf238c1 feat: preload preset_system 2023-05-29 00:08:13 +08:00
josc146
3e11128c9d feat: use model state cache to achieve 5x - 50x faster preparation time for generation 2023-05-28 23:52:38 +08:00
josc146
94971bb666 support for rwkv-4-world 2023-05-28 12:53:14 +08:00
josc146
06622b79aa update rwkv_generate 2023-05-25 20:34:42 +08:00
josc146
524d9e78e6 SwitchModelBody.customCuda 2023-05-23 11:51:43 +08:00
josc146
7989e93afe fixed torch version; CUDA acceleration utils 2023-05-23 11:19:39 +08:00
josc146
5883686003 chore 2023-05-20 15:33:38 +08:00
josc146
752b72e2c9 improve chat page 2023-05-19 20:10:30 +08:00
josc146
df8eef5f64 update 2023-05-17 21:20:41 +08:00
josc146
c947052574 preliminary usable features 2023-05-17 11:39:00 +08:00
josc146
83f0bb503c update 2023-05-15 21:55:57 +08:00
josc146
0e852daf43 backend api 2023-05-07 17:27:54 +08:00