rfc-0010: gateway interceptors by drew · Pull Request #1927 · NVIDIA/OpenShell

drew · 2026-06-16T06:42:57Z

Note

The RFC is open for feedback.

Summary

Adds RFC-0010 for Gateway Interceptors, a proposed gateway extension system for deployment-specific business logic around OpenShell gateway API operations.

Operators and external integrators need a flexible way to customize gateway API behavior to fit their own requirements — for example, enforcing tenancy, quotas, naming conventions, or policy authority. Today any such customization has to be hardcoded into gateway handlers or pushed into drivers, which mixes responsibilities and does not scale to deployment-specific requirements.

This RFC proposes a first-class extension system that lets external services observe, modify, validate, reject, or audit gateway operations at well-defined phases. We call these Gateway Interceptors.

Related to #1919

Checklist

Follows Conventional Commits
Commits are signed off (DCO)
Architecture docs updated (if applicable)

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

copy-pr-bot · 2026-06-16T06:43:00Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

ddurst-nvidia · 2026-06-22T05:01:07Z

Acknowledging: DRAFT NOT READY FOR COMMENTS

I do have comments, whenever you're ready for them.

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

elezar · 2026-06-23T14:56:10Z

+
+This RFC proposes a first-class extension system that lets external services
+observe, modify, validate, reject, or audit gateway operations at well-defined
+phases. We call these **Gateway Interceptors**.


The model seems similar to the middleware model that @pimlock is proposing in #1733. Is there some reasonable common name that could be used for both that signify intent (with qualifiers for which part of the system they are applied to)?

ddurst-nvidia

Thanks for updating the RFC, having a "nameable" GatewayInterceptor helps on readability, and reduces a lot of ambiguity I was seeing with "Interceptor."

ddurst-nvidia · 2026-06-23T15:59:25Z

+
+Every interceptor service has a timeout and response size limit. Gateway API
+interceptor bindings also have a maximum patch count.
+


The RFC defines endpoint configuration and per-review timeout behavior, but it does not say how review calls are bounded under load.

Since tonic clients fit Tower’s Service model, I’d like the RFC to describe the review path as a tower::Service stack and define the intended use of existing Layers for timeout, concurrency limits, buffering, load shedding, retry policy, and tracing.

That keeps the RFC focused on operational semantics while pointing implementers toward well-known, configurable, tested Tower layers rather than bespoke versions of the same behavior, which would be easy to confuse with a tonic::service::Interceptor.

If we don't want to bind the RFC to Rust or the tower::Service ecosystem (which this project already is), that's fine too, but we should still define the intended operational semantics and suggest that implementations use the appropriate service-middleware framework for their runtime.

We don't want to prescribe a service stack (rust + tonic). This RFC is intended to detail the contract and semantics for hooking into gateway calls. I think it'd be perfectly fine if someone wants to write a gRPC service using a different stack. It just needs to conform to the proto contract.

As part of the implementation we can give some examples for building an interceptor.

Does that address this comment?

ddurst-nvidia · 2026-06-23T16:18:21Z

+
+- `grpc://host:port` connects to a plaintext gRPC interceptor service over TCP.
+- `grpcs://host:port` connects to a TLS-protected gRPC interceptor service over TCP.
+- `unix:///path/to/socket` connects to a gRPC interceptor service over a Unix domain


The RFC still allows interceptor endpoints as unix:///path, but it does not define how the gateway authenticates the service behind that socket.

The interceptor service is external, the gateway only dials a configured path; it does not create, bind, or own the socket. Pathname reachability is not peer identity. A squatted socket path, writable parent directory, stale socket, or a typo can make the wrong process the policy authority, and that process can return allowed=true; fail_closed does not help when the RPC succeeds.

If plaintext UDS remains supported, the RFC should specify the required trust checks:

filesystem sockets only

no abstract sockets

an operator-only socket directory that is never mounted into sandboxes (no matter the policy)

no symlink traversal/path substitution

owner/mode/type verification before connect

peer-credential verification where the platform supports it.

It should also call out that UID/GID permissions are not a reliable sandbox boundary for rootful Docker deployments unless user namespace remapping (userns-remap) or an equivalent isolation property is required. The enable_user_namespaces flag in OpenShell is implemented in the k8s driver only.

But the simplest approach is: Require an authenticated transport for interceptor services, such as TLS/mTLS, so the gateway authenticates the interceptor by cryptographic identity rather than by pathname access.

Below on 219 I start to address some of this

Remote gRPC interceptors require authentication. The exact approach is out of scope for this RFC, but the implementation should support mTLS and
bearer-token authentication.

I don't understand this feedback either

It should also call out that UID/GID permissions are not a reliable sandbox boundary for rootful Docker deployments

This seems unrelated but perhaps I'm missing a nuance.

docs(rfc): add gateway interceptors RFC

c9deff1

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

drew added this to OpenShell Roadmap Jun 16, 2026

github-project-automation Bot moved this to Todo in OpenShell Roadmap Jun 16, 2026

drew added rfc and removed rfc labels Jun 16, 2026

drew removed this from OpenShell Roadmap Jun 16, 2026

drew added 2 commits June 16, 2026 00:05

docs(rfc): clarify gateway interceptors proposal

3c0335e

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

docs(rfc): clarify interceptor source of truth

3020acb

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

drew changed the title ~~docs(rfc): add gateway interceptors RFC~~ rfc-0010: gateway interceptors Jun 16, 2026

drew added the rfc label Jun 16, 2026

drew added this to OpenShell Roadmap Jun 16, 2026

github-project-automation Bot moved this to Todo in OpenShell Roadmap Jun 16, 2026

drew moved this from Todo to In progress in OpenShell Roadmap Jun 16, 2026

drew added 2 commits June 23, 2026 00:06

docs(rfc): refine gateway interceptor proposal

3bdf531

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

wip

0a26c9e

drew mentioned this pull request Jun 23, 2026

feat(gateway): add operation interceptors #1936

Draft

5 tasks

docs(rfc): document interceptor order example

b87edf2

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

drew marked this pull request as ready for review June 23, 2026 07:36

drew requested review from a team, derekwaynecarr, maxamillion and mrunalp as code owners June 23, 2026 07:36

elezar reviewed Jun 23, 2026

View reviewed changes

Comment thread rfc/0010-gateway-interceptors/README.md

docs(rfc): renumber gateway interceptors RFC

f8081d9

Signed-off-by: Drew Newberry <anewberry@nvidia.com>

elezar reviewed Jun 23, 2026

View reviewed changes

johntmyers mentioned this pull request Jun 23, 2026

OpenShell Extensibility #1904

Open

ddurst-nvidia reviewed Jun 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc-0010: gateway interceptors#1927

rfc-0010: gateway interceptors#1927
drew wants to merge 7 commits into
mainfrom
gateway-hooks

drew commented Jun 16, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Jun 16, 2026

Uh oh!

ddurst-nvidia commented Jun 22, 2026

Uh oh!

Uh oh!

elezar Jun 23, 2026

Uh oh!

ddurst-nvidia left a comment •

edited

Loading

Uh oh!

ddurst-nvidia Jun 23, 2026

Uh oh!

drew Jun 23, 2026

Uh oh!

ddurst-nvidia Jun 23, 2026

Uh oh!

drew Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		Every interceptor service has a timeout and response size limit. Gateway API
		interceptor bindings also have a maximum patch count.

Conversation

drew commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Uh oh!

copy-pr-bot Bot commented Jun 16, 2026

Uh oh!

ddurst-nvidia commented Jun 22, 2026

Uh oh!

Uh oh!

elezar Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

ddurst-nvidia left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ddurst-nvidia Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

drew Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

ddurst-nvidia Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

drew Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

drew commented Jun 16, 2026 •

edited

Loading

ddurst-nvidia left a comment •

edited

Loading