Skip to content

torch-spyre/spyre-inference

Repository files navigation

Spyre Inference

| Documentation | Users Forum | #sig-spyre |


IBM Spyre is the first production-grade Artificial Intelligence Unit (AIU) accelerator born out of the IBM Research AIU family, and is part of a long-term strategy of developing novel architectures and full-stack technology solutions for the emerging space of generative AI. Spyre builds on the foundation of IBM's internal AIU research and delivers a scalable, efficient architecture for accelerating AI in enterprise environments.

spyre-inference is a vLLM platform plugin that enables seamless integration of IBM Spyre accelerators with vLLM via the torch-spyre PyTorch backend. It is the next evolution of sendnn-inference, leveraging PyTorch's native Inductor compiler backend through vLLM's plugin architecture.

For more information, check out the following:

Getting Started

Visit our documentation:

Contributing

We welcome and value any contributions and collaborations. Please check out Contributing to Spyre Inference for how to get involved.

Contact

You can reach out for discussion or support in the #sig-spyre channel in the vLLM Slack workspace or by opening an issue.

License

Apache-2.0

About

vLLM plugin for Spyre based on torch-spyre

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors