itsmiso-ai

Follow

Miso itsmiso-ai

Follow

Achievements

Achievements

Popular repositories Loading

dispatch-workflow dispatch-workflow Public

Python
LLMKube LLMKube Public

Forked from defilantech/LLMKube

Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Runtimes: llama.cpp, vLLM, TGI, mlx-server. Multi-GPU sharding,…

Go 1