support MLC-AI/mlc & RWKV ai00_server

in regards of running local LLMs, may I suggest to support non cuda setups as first class citizen.

there a 2 outstanding projects out there, which ignore the  __every GPU is a Nivida__ credo and therefor are usable on every other hardware - which is most likely the majority (but is totally ignored)

pls. have a look at:
https://github.com/mlc-ai/mlc-llm
and 
https://github.com/BlinkDL/RWKV-LM
with the outstanding fast & compact server, which runs 13b-vicuna quant. on an old rx580 with 8GB  (via vulkan)
https://github.com/cgisky1980/ai00_rwkv_server
(full openAI-API support is pending).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support MLC-AI/mlc & RWKV ai00_server #29

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

support MLC-AI/mlc & RWKV ai00_server #29

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions