josc146
|
a116eff7df
|
webgpu max_buffer_size
|
2023-12-25 18:08:13 +08:00 |
|
josc146
|
512c4d0f73
|
improve role-playing effect
|
2023-12-22 10:51:09 +08:00 |
|
josc146
|
8a3905c09a
|
reduce precompiled web_rwkv_py size
|
2023-12-15 16:26:01 +08:00 |
|
josc146
|
f7494b0cfb
|
update midi_filter_config.json
|
2023-12-14 21:18:48 +08:00 |
|
josc146
|
18d4b2304e
|
WebGPU (Python) strategy
|
2023-12-14 20:39:42 +08:00 |
|
josc146
|
46e9a2f5b2
|
add precompiled web_rwkv_py
|
2023-12-14 18:42:00 +08:00 |
|
josc146
|
0ddd2e9fea
|
add WebGPU Python Mode (https://github.com/cryscan/web-rwkv-py)
|
2023-12-14 18:37:07 +08:00 |
|
josc146
|
e0bf44d82f
|
bump MIDI-LLM-tokenizer (fix note off)
|
2023-12-14 13:33:27 +08:00 |
|
josc146
|
82c9825da8
|
rwkv.cpp python38 compatibility
|
2023-12-12 23:19:18 +08:00 |
|
josc146
|
26b30f0dbe
|
add load failed traceback
|
2023-12-12 23:16:48 +08:00 |
|
josc146
|
b14fbc29b7
|
rwkv.cpp(ggml) support
|
2023-12-12 20:29:55 +08:00 |
|
josc146
|
9b7b651ef9
|
feat: import midi file
|
2023-12-10 22:38:31 +08:00 |
|
josc146
|
d9e25ad69f
|
better state cache
|
2023-12-08 15:28:33 +08:00 |
|
josc146
|
c853c5b60b
|
chore
|
2023-12-06 23:09:39 +08:00 |
|
josc146
|
053a08f5b7
|
update convert_safetensors.py
|
2023-12-06 23:08:40 +08:00 |
|
josc146
|
861e245062
|
RWKV_RESCALE_LAYER 999 for music model
|
2023-12-04 17:51:21 +08:00 |
|
josc146
|
0063c171f3
|
upgrade to rwkv 0.8.22 (rwkv6 support)
|
2023-11-24 17:55:16 +08:00 |
|
josc146
|
dbf0dccc9d
|
add tokenizer(/switch-model) to /docs
|
2023-11-20 20:11:45 +08:00 |
|
josc146
|
c8470e77fd
|
fix state_cache of deploy mode
|
2023-11-17 21:32:11 +08:00 |
|
josc146
|
9ede7d7c6d
|
strict default_stop
|
2023-11-17 21:18:52 +08:00 |
|
josc146
|
7235e1067b
|
add deployment mode. If /switch-model with deploy: true , will disable /switch-model, /exit and other dangerous APIs (state cache APIs, part of midi APIs)
|
2023-11-08 23:29:42 +08:00 |
|
josc146
|
d249a4c29a
|
print error.txt
|
2023-11-08 22:57:38 +08:00 |
|
josc146
|
cfa3669f6f
|
fix /docs default api params (Pydantic v2)
|
2023-11-07 22:53:11 +08:00 |
|
josc146
|
db6fbe8366
|
add python webui server
|
2023-11-07 22:22:29 +08:00 |
|
josc146
|
64826b9af7
|
fix log encoding error
|
2023-11-05 21:00:31 +08:00 |
|
josc146
|
47b0c35441
|
update ngrok_connect
|
2023-11-04 20:22:28 +08:00 |
|
josc146
|
1dcda47013
|
improve startup process
|
2023-11-04 20:21:55 +08:00 |
|
josc146
|
1f81a1e5a8
|
upgrade to rwkv 0.8.20
|
2023-11-03 23:27:14 +08:00 |
|
josc146
|
14b90bb36b
|
improve dml mode performance (20% faster, https://github.com/BlinkDL/ChatRWKV/pull/181)
|
2023-10-30 20:24:57 +08:00 |
|
josc146
|
f86b7f1f08
|
python38 compatibility
|
2023-10-29 14:11:11 +08:00 |
|
josc146
|
ff7306349a
|
improve memory usage of state cache
|
2023-10-28 23:04:49 +08:00 |
|
josc146
|
c87de93498
|
allow conversation with some document (.pdf, .txt)
|
2023-10-27 11:36:29 +08:00 |
|
josc146
|
faf1852012
|
update stop strategy
|
2023-10-26 17:47:40 +08:00 |
|
josc146
|
43cfab5d4b
|
change default World series prefix to User/Assistant
|
2023-10-26 16:58:53 +08:00 |
|
josc146
|
627a20936d
|
RWKVType now no longer relies on the file name
|
2023-10-26 16:55:33 +08:00 |
|
josc146
|
d7ba88953d
|
chore
|
2023-10-25 22:53:14 +08:00 |
|
josc146
|
30e1c3171e
|
update kernel (CUDA Compute Capability 5.3)
|
2023-10-25 22:53:14 +08:00 |
|
josc146
|
1f058b16ac
|
update kernel (CUDA Compute Capability 6.1, Previously 7.5)
|
2023-10-25 22:53:13 +08:00 |
|
josc146
|
4a192f4057
|
upgrade to webgpu 0.2.2 (https://github.com/josStorer/ai00_rwkv_server)
|
2023-10-25 21:02:44 +08:00 |
|
josc146
|
0331bf47f7
|
upgrade rwkv 0.8.16 (DirectML support; rwkv 5.2 no longer needs to ensure custom cuda kernel enabled)
|
2023-10-25 17:56:18 +08:00 |
|
josc146
|
2acdaa96b2
|
chore
|
2023-10-25 17:51:59 +08:00 |
|
josc146
|
1d200d53ab
|
fix beta linux kernel
|
2023-10-25 17:51:13 +08:00 |
|
josc146
|
df9e1f408e
|
add /file-to-text api
|
2023-10-25 17:14:33 +08:00 |
|
josc146
|
46b3b285f5
|
upgrade packages
|
2023-10-25 17:07:40 +08:00 |
|
josc146
|
0005816c1d
|
fix linux kernel (partial revert 68228a45 )
|
2023-10-05 00:08:18 +08:00 |
|
josc146
|
68228a4552
|
rwkv5 pre-compiled kernel (for windows)
|
2023-10-03 13:39:07 +08:00 |
|
josc146
|
79851433f8
|
upgrade rwkv pip (0.8.13)
|
2023-10-03 13:33:55 +08:00 |
|
josc146
|
d7abe5f0d1
|
add pre-compiled beta cuda kernel (rwkv-beta==0.8.5, 40%+ faster for fp16) (thanks to #180, pre-compiled kernel of RTX 40 Series will be included later)
|
2023-09-18 23:02:49 +08:00 |
|
josc146
|
5e5e1e9651
|
custom tokenizer .txt support
|
2023-09-18 17:20:55 +08:00 |
|
josc146
|
a25965530c
|
custom tokenizer (#77)
|
2023-09-16 00:34:11 +08:00 |
|
josc146
|
d7dcc90008
|
chore
|
2023-09-15 16:31:14 +08:00 |
|
josc146
|
df969fcfc6
|
upgrade cuda-beta
|
2023-09-15 16:30:11 +08:00 |
|
josc146
|
50ff7ef6bc
|
always use requirements.txt
|
2023-08-27 23:52:52 +08:00 |
|
josc146
|
a24b78e8c3
|
python-backend: extra ChatCompletionBody params (raw , presystem );
add default_stop when stop is null
|
2023-08-27 21:21:11 +08:00 |
|
josc146
|
c8025f1cff
|
allow message content to be empty
|
2023-08-27 21:02:54 +08:00 |
|
josc146
|
02d5d641d1
|
chore
|
2023-08-24 22:48:54 +08:00 |
|
josc146
|
ef53951a16
|
webgpu support
|
2023-08-16 23:07:58 +08:00 |
|
josc146
|
61cea2a784
|
add misc API (/models and /dashboard/billing/credit_grants )
|
2023-08-14 23:37:55 +08:00 |
|
josc146
|
8a13bd3c1e
|
add rwkv-cuda-beta support (faster)
|
2023-08-14 22:07:15 +08:00 |
|
josc146
|
da68926e9c
|
chore (AddStateBody class)
|
2023-08-13 21:27:29 +08:00 |
|
josc146
|
e0b7453883
|
allow multiple systems
|
2023-08-04 22:27:55 +08:00 |
|
josc146
|
91e2828a95
|
allow completions input to be null
|
2023-08-04 22:22:59 +08:00 |
|
josc146
|
b3e35a4cdd
|
allow custom user_name and assistant_name (/chat/completions API)
|
2023-07-31 22:48:54 +08:00 |
|
josc146
|
8764c37b03
|
RWKVType
|
2023-07-31 22:46:13 +08:00 |
|
josc146
|
d12a173f39
|
global penalty
|
2023-07-31 22:02:28 +08:00 |
|
josc146
|
aecacde819
|
remove response field of completions api
|
2023-07-29 19:20:43 +08:00 |
|
josc146
|
3ef22239eb
|
improve default ChatCompletion stop
|
2023-07-29 19:19:38 +08:00 |
|
josc146
|
719090cc8c
|
improve python backend startup speed
|
2023-07-29 19:18:01 +08:00 |
|
josc146
|
9d89b6f4db
|
fix params
|
2023-07-28 22:13:19 +08:00 |
|
josc146
|
d0fd480bd6
|
chore
|
2023-07-26 22:24:26 +08:00 |
|
josc146
|
1df345b5eb
|
improve embeddings API results
|
2023-07-25 20:30:43 +08:00 |
|
josc146
|
77868c798b
|
chore
|
2023-07-25 16:37:06 +08:00 |
|
josc146
|
f56748a941
|
improve python backend startup speed
|
2023-07-25 16:14:29 +08:00 |
|
josc146
|
29c5b1d804
|
add midi api
|
2023-07-25 16:11:17 +08:00 |
|
josc146
|
34095a6c36
|
support for stop array
|
2023-07-25 16:10:22 +08:00 |
|
josc146
|
05b9b42b56
|
add support for MIDI RWKV
|
2023-07-25 16:09:31 +08:00 |
|
josc146
|
9b3b06ab04
|
fix input with array type (#96, #107)
|
2023-07-17 12:59:45 +08:00 |
|
josc146
|
994fc7c828
|
fix cross-device state cache exception
|
2023-07-11 11:20:12 +08:00 |
|
josc146
|
f9f1d5c9fc
|
improve /completions api compatibility
|
2023-07-10 20:45:08 +08:00 |
|
josc146
|
6fbb86667c
|
improve python script error messages
|
2023-07-07 20:16:35 +08:00 |
|
josc146
|
987854fe49
|
lora finetune (need to be refactored)
|
2023-07-03 17:41:47 +08:00 |
|
josc146
|
417389c5f6
|
improve for python3.8 3.9
|
2023-06-29 20:12:11 +08:00 |
|
josc146
|
9ed3547738
|
rwkv pip 0.8.0
|
2023-06-28 19:36:15 +08:00 |
|
josc146
|
131a7ddf4a
|
fix the prompt cache that contains potential error
|
2023-06-21 16:07:16 +08:00 |
|
josc146
|
43bc08648d
|
update manifest
|
2023-06-20 16:07:52 +08:00 |
|
josc146
|
e93c77394d
|
add usage
|
2023-06-20 15:55:52 +08:00 |
|
josc146
|
8963543159
|
embeddings api compatible with openai api and langchain(sdk)
|
2023-06-19 22:51:06 +08:00 |
|
josc146
|
377f71b16b
|
type
|
2023-06-19 22:32:02 +08:00 |
|
josc146
|
d32351c130
|
exact model name
|
2023-06-19 22:30:49 +08:00 |
|
josc146
|
967be6f88f
|
refactor completions api
|
2023-06-18 20:16:52 +08:00 |
|
josc146
|
721653a812
|
fix the state cache crash caused by bad prompts
|
2023-06-15 22:37:00 +08:00 |
|
josc146
|
21c3009945
|
improve api docs
|
2023-06-15 21:52:22 +08:00 |
|
josc146
|
51c5696bb9
|
improve python dependencies installation
|
2023-06-14 22:21:17 +08:00 |
|
josc146
|
714b8834c7
|
chore
|
2023-06-13 22:47:17 +08:00 |
|
josc146
|
5896593951
|
max_trie_len
|
2023-06-12 15:22:17 +08:00 |
|
josc146
|
8431b5d24f
|
log Generation Prompt
|
2023-06-12 13:41:51 +08:00 |
|
josc146
|
bbd1ac1484
|
allow unloading model with switch-model
|
2023-06-12 12:34:03 +08:00 |
|
josc146
|
5990567a79
|
avoid misoperations of state_cache
|
2023-06-12 12:32:50 +08:00 |
|
josc146
|
fa0fcc2c89
|
add support for python3.8 3.9
|
2023-06-12 12:09:23 +08:00 |
|
josc146
|
cea1d8b4d1
|
add logs for state cache and switch-model
|
2023-06-09 20:46:19 +08:00 |
|