add pre-compiled beta cuda kernel (rwkv-beta==0.8.5, 40%+ faster for fp16) (thanks to #180, pre-compiled kernel of RTX 40 Series will be included later)

This commit is contained in:
josc146
2023-09-18 23:02:49 +08:00
parent 5e5e1e9651
commit d7abe5f0d1
3 changed files with 3 additions and 0 deletions

Binary file not shown.