Build orchestration for LLM pipelines. Define multi-step model workflows in YAML — download, fine-tune, evaluate, and deploy — and run them locally or on cloud infrastructure.
This repository is currently in alpha. The code and documentation are under active development and may change frequently as we work to improve usability and reliability. Contributions and feedback are welcome, but please be aware that breaking changes may occur.
- What is Granite.Build?
- Quick start
- Example
build.yaml - Repository layout
- Features
- Supported environments
- CLI
- REST API
- Documentation
- Try the demos
- Contributing
- License
Granite.Build orchestrates LLM build pipelines. You describe your workflow in a build.yaml file — which models to download, how to fine-tune them, what evaluations to run — and Granite.Build executes each step in the environment you choose: a local Docker container, a Kubernetes cluster, a cloud GPU instance, or a plain bash process on your laptop.
The system has three main components:
-
gbserver — the orchestration server. It provides a REST API (
/api/v1) for build management and a build watcher that polls for pending builds and dispatches them to execution environments. It stores build metadata in SQLite (standalone) or PostgreSQL (production). -
gb (gbcli) — the command-line client. It talks to the server's REST API to submit builds, list status, manage artifacts, and more.
-
build.yaml — the pipeline definition. Each file declares a set of named targets (logical stages like "download", "fine-tune", "evaluate"). Each target specifies an execution environment, input/output artifacts, and one or more steps to run. Targets can depend on each other through artifact bindings — when an upstream target produces an output, downstream targets that reference it are automatically dispatched.
flowchart LR
BY@{ shape: doc, label: "build.yaml" } -->|"gb build start"| API["gbserver REST API"]
API --> BW["BuildWatcher"]
BW --> BR["BuildRunner"]
BR --> Docker
BR --> Kubernetes
BR --> Bash
BR --> RunPod
BR --> SkyPilot
Docker & Kubernetes & Bash & RunPod & SkyPilot --> AS["Artifact stores</br>(HuggingFace, file://, git://)"]
classDef white fill:#fff,color:#000,stroke:#999
class BY white
The BuildWatcher polls storage for pending builds and creates a BuildRunner for each one. The build runs in an associated Space providing environments, credentials, artifact stores and step implementations.
The buildrunner walks the target graph, resolving dependencies and launching steps through the configured Environment (Docker, Kubernetes, Bash, RunPod, or SkyPilot). Each step can pull inputs from and push outputs to artifact stores selected by URI scheme (hf://, file://, git://, cos://).
Granite.build can be configured to run in the cloud or on the local machine. A cloud-based configuration can be seen here. Most of what follows utilizes the standalone configuration, shown below.
Five commands to a running build, using the bundled standalone-quickstart sample.
In a new terminal, run the following:
# 1. Clone and enter the repo
git clone git@github.com:ibm-granite/granite.build.git
cd granite.build
# 2. Create the venv and install (no Artifactory or cloud creds needed)
make standalone-venv PYTHON=python3.13
source .venv/bin/activate
# 3. Start the standalone server, pointed at the in-repo local space
gbserver standalone --space-dir configurations/spaces/local
In a second terminal, run the build using the servers started above:
# 4. Activate the venv and submit the sample build
cd granite.build
source .venv/bin/activate
export GB_ENVIRONMENT=STANDALONE
gb build start -f samples/standalone/standalone-quickstart/build.yaml
# 5. Watch progress
gb build status <build-id>
gb build log <build-id>The sample runs a single step in a local bash process — no Docker required. To switch backends, edit the environment_uri line in
samples/standalone/standalone-quickstart/build.yaml; the file has bash, docker, runpod, and skypilot options pre-commented.
Auth note (skip for localhost): when the client and server are both on the same host,
gbserverallows unauthenticated access from127.0.0.1/::1and the quickstart above just works. If you're runninggbserveron a remote box (or hitting auth errors), set a shared secret in both terminals before running steps 3 and 4:export GBSERVER_API_KEY="my-secret-key" # same value in both terminals
For a longer walkthrough of the same path, see docs/getting-started.md.
A minimal pipeline that runs a single step in a Docker container:
llm.build: # alias: granite.build (both keys are accepted)
name: my-build
targets:
download:
environment_uri: space://environments/docker
inputs:
model:
uri: hf://huggingface.co/ibm-granite/granite-3.3-2b-instruct
outputs:
model:
uri: file:workspace/model
steps:
- step_uri: space://steps/somestepA multi-target pipeline chains stages through bindings:
llm.build:
name: tune-and-eval
targets:
download:
environment_uri: space://environments/docker
outputs:
model: { uri: file:workspace/model }
steps:
- step_uri: space://steps/somestep
fine-tune:
environment_uri: space://environments/docker
inputs:
model: { binding: download.model }
outputs:
checkpoint: { uri: file:workspace/checkpoint }
steps:
- step_uri: space://steps/sft
evaluate:
environment_uri: space://environments/docker
inputs:
model: { binding: fine-tune.checkpoint }
steps:
- step_uri: space://steps/evalFor the full schema, see docs/users/build-yaml-reference.md.
| Path | Description |
|---|---|
src/gbserver/ |
Build orchestration server (REST API, build engine, storage). |
src/gbcli/ |
CLI client (gb) for interacting with gbserver. |
src/gbcommon/ |
Shared types and utilities. |
docs/ |
User, operator, and contributor docs — start at docs/README.md. |
samples/ |
Sample build configs, environments, and steps. The standalone-quickstart is the canonical first build. |
examples/ |
Worked examples for specific scenarios. |
configurations/ |
Space, environment, step, and assetstore configurations consumed by builds. configurations/assets/ holds the reusable assetstores, environments, and steps; configurations/spaces/local/ is the user-facing space for GB_ENVIRONMENT=STANDALONE and ships the build templates. |
test/ |
Test suites for all components. |
scripts/ |
Helper scripts including the standalone and SLURM demos. |
k8s/ |
Helm charts for production Kubernetes deployment. |
Makefile |
make standalone-venv, make demo-venv, make image, format/lint targets. |
- Multi-environment execution — Docker, Kubernetes, RunPod, SkyPilot/AWS, or local bash
- HuggingFace Hub integration — download and push models and datasets via
hf://URIs - Pipeline orchestration — chain steps with artifact bindings in a single
build.yaml - CLI client —
gbcommand for build management, artifact handling, model operations, and more - REST API — FastAPI-based build management at
/api/v1 - Standalone mode — SQLite + thread-based execution, no external services needed
- Lineage tracking — records data provenance of builds, targets, and artifacts
| Environment | Platform | GPU Support | Status |
|---|---|---|---|
| Docker | Linux, macOS | Yes (nvidia-container-toolkit) | Stable |
| Bash | macOS / Linux | CPU only | Stable |
| Kubernetes | Linux | Yes | Stable |
| SLURM (via SkyPilot) | Linux | Yes (auto-detected) | Beta |
| RunPod | Cloud | Yes | Beta |
| SkyPilot / AWS | Cloud | Yes | Beta |
The CLI client is available as multiple equivalent entry points: gb, gbcli, llmbuild, llmb, lamb. The test harness ships as gbtest, and the server ships as gbserver.
gb # client (build, artifact, space, secret, model, ...)
gbserver # server (standalone, rest-server, build-watch, build, ...)
gbtest # YAML-driven build assertions for tests
Run gb --help or gbserver --help for top-level usage. Common flows:
gb build start -f build.yaml # submit a build
gb build list # list recent builds
gb build status <build-id> # show build state and per-step status
gb build log <build-id> # stream logs
gb build cancel <build-id> # cancel a running build
gb artifact list # list artifactsFor the full subcommand reference, see docs/users/cli-reference.md.
The REST API is available at /api/v1 when the server is running. Start with gbserver standalone or gbserver rest-server and visit http://localhost:8080/docs for the interactive OpenAPI documentation. Authentication options (GitHub, IBMid, API key) are documented in docs/operators/multi-provider-authentication.md.
The docs/ directory has complete reference material. Three reading paths from the docs index:
- Writing a build →
build.yamlreference, CLI reference, HuggingFace push, features (retry, target reuse, lineage, gbtest). - Running gbserver →
environment.yamlreference, setup scripts, SkyPilot Kubernetes setup, troubleshooting. - Changing gbserver → architecture diagram, environment classes.
End-to-end demos with TRL fine-tuning and unitxt evaluation. Each runs locally and tears down cleanly. Full setup instructions in docs/demos.md.
# Standalone Docker — fine-tune + eval in containers on this machine
make demo-venv PYTHON=python3.13 && source .venv/bin/activate
bash scripts/demo-standalone.sh
# SLURM via SkyPilot — same workload on a local Docker SLURM cluster, with MinIO push
make g4os-skypilot-venv PYTHON=python3.13 && source .venv/bin/activate
make minio-setup && make slurm-setup
bash scripts/demo-slurm.shSee CONTRIBUTING.md for development setup, code style, and pull request guidelines. This project follows the Contributor Covenant v2.1. To report a vulnerability, see SECURITY.md.


