- ๐ Make LLMs run on spyre accelerator and CPUs (yes CPUs)
- ๐ฉ Bend vLLM into places it wasn't designed for
- ๐๏ธ Fix build systems across architectures
- ๐ฆ Make multi-arch containers actually behave
- โก CPU-only LLM inference pipelines
- ๐๏ธ s390x(cpu) and spyre support for modern ML stacks
- โธ๏ธ Infra that works beyond a single machine
๐ฅ squeezing every drop of performance.
โ ๏ธ making PyTorch do questionable things
โธ๏ธ running clean infra on Kubernetes


