GitHub - pie-project/pie: Pie: Programmable LLM Serving

Pie: Programmable serving system for emerging LLM applications

Website | Guide | Reference | Paper (SOSP'25)

A programmable serving system for custom inference logic, stateful agents, and serving-side optimization.

Note Pie is pre-release software under active development. It is best suited for testing and research right now.

What is Pie?

Today's LLM serving engines (e.g., vLLM, SGLang, TensorRT-LLM) are black boxes: prompt in, tokens out. But AI agents are a different kind of workload. They branch, call tools, retry, and coordinate long-running workflows, and forcing them through a monolithic token-generation pipeline leads to wasted round trips, KV cache thrashing, and engine patches for every new decoding trick.

Pie is a programmable serving system. It runs small user-supplied WebAssembly programs, called inferlets, directly next to the model. Inferlets have direct access to the KV cache and forward pass, so agent loops, tool calls, custom samplers, and cache policies can be customized and optimized per-application without modifying the engine.

Quick Start

Pie is a standalone binary, no Python needed.

For macOS and Linux:

curl -fsSL https://pie-project.org/install.sh | bash

For Windows, follow the installation guide.

Then configure and run:

pie config init
pie run text-completion -- --prompt "The capital of France is"

Project Layout

Directory	Description
`runtime/`	Inferlet runtime
`server/`	CLI
`inferlets/`	Example inferlets
`sdk/`	Inferlet SDKs (Rust · Python · JavaScript)
`client/`	Client libraries (Rust · Python · JavaScript)
`driver/`	Pie drivers (portable / CUDA / vLLM / SGLang)
`website/`	pie-project.org docs site

Getting Help

Questions and bug reports are welcome on GitHub Issues and GitHub Discussions.

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 2,095 Commits
.github		.github
benches		benches
client		client
driver		driver
inferlets		inferlets
runtime		runtime
scripts		scripts
sdk		sdk
server		server
tests		tests
website		website
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Pie?

Quick Start

Project Layout

Getting Help

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

What is Pie?

Quick Start

Project Layout

Getting Help

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages