josc146
|
faf1852012
|
update stop strategy
|
2023-10-26 17:47:40 +08:00 |
|
josc146
|
43cfab5d4b
|
change default World series prefix to User/Assistant
|
2023-10-26 16:58:53 +08:00 |
|
josc146
|
627a20936d
|
RWKVType now no longer relies on the file name
|
2023-10-26 16:55:33 +08:00 |
|
josc146
|
d7ba88953d
|
chore
|
2023-10-25 22:53:14 +08:00 |
|
josc146
|
30e1c3171e
|
update kernel (CUDA Compute Capability 5.3)
|
2023-10-25 22:53:14 +08:00 |
|
josc146
|
1f058b16ac
|
update kernel (CUDA Compute Capability 6.1, Previously 7.5)
|
2023-10-25 22:53:13 +08:00 |
|
josc146
|
4a192f4057
|
upgrade to webgpu 0.2.2 (https://github.com/josStorer/ai00_rwkv_server)
|
2023-10-25 21:02:44 +08:00 |
|
josc146
|
0331bf47f7
|
upgrade rwkv 0.8.16 (DirectML support; rwkv 5.2 no longer needs to ensure custom cuda kernel enabled)
|
2023-10-25 17:56:18 +08:00 |
|
josc146
|
2acdaa96b2
|
chore
|
2023-10-25 17:51:59 +08:00 |
|
josc146
|
1d200d53ab
|
fix beta linux kernel
|
2023-10-25 17:51:13 +08:00 |
|
josc146
|
df9e1f408e
|
add /file-to-text api
|
2023-10-25 17:14:33 +08:00 |
|
josc146
|
46b3b285f5
|
upgrade packages
|
2023-10-25 17:07:40 +08:00 |
|
josc146
|
0005816c1d
|
fix linux kernel (partial revert 68228a45 )
|
2023-10-05 00:08:18 +08:00 |
|
josc146
|
68228a4552
|
rwkv5 pre-compiled kernel (for windows)
|
2023-10-03 13:39:07 +08:00 |
|
josc146
|
79851433f8
|
upgrade rwkv pip (0.8.13)
|
2023-10-03 13:33:55 +08:00 |
|
josc146
|
d7abe5f0d1
|
add pre-compiled beta cuda kernel (rwkv-beta==0.8.5, 40%+ faster for fp16) (thanks to #180, pre-compiled kernel of RTX 40 Series will be included later)
|
2023-09-18 23:02:49 +08:00 |
|
josc146
|
5e5e1e9651
|
custom tokenizer .txt support
|
2023-09-18 17:20:55 +08:00 |
|
josc146
|
a25965530c
|
custom tokenizer (#77)
|
2023-09-16 00:34:11 +08:00 |
|
josc146
|
d7dcc90008
|
chore
|
2023-09-15 16:31:14 +08:00 |
|
josc146
|
df969fcfc6
|
upgrade cuda-beta
|
2023-09-15 16:30:11 +08:00 |
|
josc146
|
50ff7ef6bc
|
always use requirements.txt
|
2023-08-27 23:52:52 +08:00 |
|
josc146
|
a24b78e8c3
|
python-backend: extra ChatCompletionBody params (raw , presystem );
add default_stop when stop is null
|
2023-08-27 21:21:11 +08:00 |
|
josc146
|
c8025f1cff
|
allow message content to be empty
|
2023-08-27 21:02:54 +08:00 |
|
josc146
|
02d5d641d1
|
chore
|
2023-08-24 22:48:54 +08:00 |
|
josc146
|
ef53951a16
|
webgpu support
|
2023-08-16 23:07:58 +08:00 |
|
josc146
|
61cea2a784
|
add misc API (/models and /dashboard/billing/credit_grants )
|
2023-08-14 23:37:55 +08:00 |
|
josc146
|
8a13bd3c1e
|
add rwkv-cuda-beta support (faster)
|
2023-08-14 22:07:15 +08:00 |
|
josc146
|
da68926e9c
|
chore (AddStateBody class)
|
2023-08-13 21:27:29 +08:00 |
|
josc146
|
e0b7453883
|
allow multiple systems
|
2023-08-04 22:27:55 +08:00 |
|
josc146
|
91e2828a95
|
allow completions input to be null
|
2023-08-04 22:22:59 +08:00 |
|
josc146
|
b3e35a4cdd
|
allow custom user_name and assistant_name (/chat/completions API)
|
2023-07-31 22:48:54 +08:00 |
|
josc146
|
8764c37b03
|
RWKVType
|
2023-07-31 22:46:13 +08:00 |
|
josc146
|
d12a173f39
|
global penalty
|
2023-07-31 22:02:28 +08:00 |
|
josc146
|
aecacde819
|
remove response field of completions api
|
2023-07-29 19:20:43 +08:00 |
|
josc146
|
3ef22239eb
|
improve default ChatCompletion stop
|
2023-07-29 19:19:38 +08:00 |
|
josc146
|
719090cc8c
|
improve python backend startup speed
|
2023-07-29 19:18:01 +08:00 |
|
josc146
|
9d89b6f4db
|
fix params
|
2023-07-28 22:13:19 +08:00 |
|
josc146
|
d0fd480bd6
|
chore
|
2023-07-26 22:24:26 +08:00 |
|
josc146
|
1df345b5eb
|
improve embeddings API results
|
2023-07-25 20:30:43 +08:00 |
|
josc146
|
77868c798b
|
chore
|
2023-07-25 16:37:06 +08:00 |
|
josc146
|
f56748a941
|
improve python backend startup speed
|
2023-07-25 16:14:29 +08:00 |
|
josc146
|
29c5b1d804
|
add midi api
|
2023-07-25 16:11:17 +08:00 |
|
josc146
|
34095a6c36
|
support for stop array
|
2023-07-25 16:10:22 +08:00 |
|
josc146
|
05b9b42b56
|
add support for MIDI RWKV
|
2023-07-25 16:09:31 +08:00 |
|
josc146
|
9b3b06ab04
|
fix input with array type (#96, #107)
|
2023-07-17 12:59:45 +08:00 |
|
josc146
|
994fc7c828
|
fix cross-device state cache exception
|
2023-07-11 11:20:12 +08:00 |
|
josc146
|
f9f1d5c9fc
|
improve /completions api compatibility
|
2023-07-10 20:45:08 +08:00 |
|
josc146
|
6fbb86667c
|
improve python script error messages
|
2023-07-07 20:16:35 +08:00 |
|
josc146
|
987854fe49
|
lora finetune (need to be refactored)
|
2023-07-03 17:41:47 +08:00 |
|
josc146
|
417389c5f6
|
improve for python3.8 3.9
|
2023-06-29 20:12:11 +08:00 |
|
josc146
|
9ed3547738
|
rwkv pip 0.8.0
|
2023-06-28 19:36:15 +08:00 |
|
josc146
|
131a7ddf4a
|
fix the prompt cache that contains potential error
|
2023-06-21 16:07:16 +08:00 |
|
josc146
|
43bc08648d
|
update manifest
|
2023-06-20 16:07:52 +08:00 |
|
josc146
|
e93c77394d
|
add usage
|
2023-06-20 15:55:52 +08:00 |
|
josc146
|
8963543159
|
embeddings api compatible with openai api and langchain(sdk)
|
2023-06-19 22:51:06 +08:00 |
|
josc146
|
377f71b16b
|
type
|
2023-06-19 22:32:02 +08:00 |
|
josc146
|
d32351c130
|
exact model name
|
2023-06-19 22:30:49 +08:00 |
|
josc146
|
967be6f88f
|
refactor completions api
|
2023-06-18 20:16:52 +08:00 |
|
josc146
|
721653a812
|
fix the state cache crash caused by bad prompts
|
2023-06-15 22:37:00 +08:00 |
|
josc146
|
21c3009945
|
improve api docs
|
2023-06-15 21:52:22 +08:00 |
|
josc146
|
51c5696bb9
|
improve python dependencies installation
|
2023-06-14 22:21:17 +08:00 |
|
josc146
|
714b8834c7
|
chore
|
2023-06-13 22:47:17 +08:00 |
|
josc146
|
5896593951
|
max_trie_len
|
2023-06-12 15:22:17 +08:00 |
|
josc146
|
8431b5d24f
|
log Generation Prompt
|
2023-06-12 13:41:51 +08:00 |
|
josc146
|
bbd1ac1484
|
allow unloading model with switch-model
|
2023-06-12 12:34:03 +08:00 |
|
josc146
|
5990567a79
|
avoid misoperations of state_cache
|
2023-06-12 12:32:50 +08:00 |
|
josc146
|
fa0fcc2c89
|
add support for python3.8 3.9
|
2023-06-12 12:09:23 +08:00 |
|
josc146
|
cea1d8b4d1
|
add logs for state cache and switch-model
|
2023-06-09 20:46:19 +08:00 |
|
josc146
|
635767408f
|
fix UnboundLocalError: local variable 'response' referenced before assignment
|
2023-06-08 13:30:34 +08:00 |
|
josc146
|
9bd9b9ecbd
|
add requirements_without_cyac.txt
|
2023-06-05 22:58:56 +08:00 |
|
josc146
|
4e75531651
|
fix the crash issue caused by temperature being 0
|
2023-06-04 11:53:33 +08:00 |
|
josc146
|
edc6ac7297
|
chore
|
2023-06-03 20:34:33 +08:00 |
|
josc146
|
966b912013
|
improve logs
|
2023-06-03 19:28:37 +08:00 |
|
josc146
|
dc71054e61
|
improve logs
|
2023-06-03 17:36:50 +08:00 |
|
josc146
|
38b775c937
|
add logs
|
2023-06-03 17:12:59 +08:00 |
|
josc146
|
b41a2e7039
|
move state cache to memory (todo: state cache db)
|
2023-06-02 21:33:57 +08:00 |
|
josc146
|
b63370928d
|
macOS
|
2023-06-01 16:54:21 +08:00 |
|
josc146
|
2f5a7d2d51
|
fix_tokens
|
2023-05-31 16:07:09 +08:00 |
|
josc146
|
cf16e54463
|
fix_tokens
|
2023-05-31 14:55:13 +08:00 |
|
josc146
|
c8b2bb53ef
|
improve system for rwkv-4-world
|
2023-05-31 12:46:06 +08:00 |
|
josc146
|
8291c50058
|
safe ModelConfigBody
|
2023-05-30 23:13:27 +08:00 |
|
josc146
|
9945338458
|
chore
|
2023-05-30 11:52:33 +08:00 |
|
josc146
|
53b6a5ffe0
|
allow system to be placed anywhere
|
2023-05-29 22:26:22 +08:00 |
|
josc146
|
da033ab096
|
chore
|
2023-05-29 20:51:20 +08:00 |
|
josc146
|
142e30622e
|
send response even token is END_OF_TEXT
|
2023-05-29 20:17:29 +08:00 |
|
josc146
|
55bb33bcbb
|
embed all core dependencies
|
2023-05-29 20:14:42 +08:00 |
|
josc146
|
6fc5a335fb
|
embed dependencies
|
2023-05-29 09:39:16 +08:00 |
|
josc146
|
fecdf238c1
|
feat: preload preset_system
|
2023-05-29 00:08:13 +08:00 |
|
josc146
|
3e11128c9d
|
feat: use model state cache to achieve 5x - 50x faster preparation time for generation
|
2023-05-28 23:52:38 +08:00 |
|
josc146
|
94971bb666
|
support for rwkv-4-world
|
2023-05-28 12:53:14 +08:00 |
|
josc146
|
b7fb8ed898
|
improve api concurrency performance
|
2023-05-27 15:18:12 +08:00 |
|
josc146
|
06622b79aa
|
update rwkv_generate
|
2023-05-25 20:34:42 +08:00 |
|
josc146
|
bb8af451f6
|
fix cuda40 kernel
|
2023-05-25 00:22:09 +08:00 |
|
josc146
|
77ce87d209
|
update cuda40 kernel
|
2023-05-24 22:18:14 +08:00 |
|
josc146
|
f439b3d382
|
add api host setting
|
2023-05-24 22:03:30 +08:00 |
|
josc146
|
bcb38d991a
|
add role: "system" support
|
2023-05-24 14:01:22 +08:00 |
|
josc146
|
c741b2a203
|
fix api completion_lock (#6)
|
2023-05-24 11:45:55 +08:00 |
|
josc146
|
9a3657e6ea
|
delete cache before updating
|
2023-05-23 12:37:13 +08:00 |
|
josc146
|
1d08719645
|
update requirements and /status
|
2023-05-23 12:13:12 +08:00 |
|
josc146
|
524d9e78e6
|
SwitchModelBody.customCuda
|
2023-05-23 11:51:43 +08:00 |
|