feat: Add request cancellation to C++ gRPC client by yinggeh · Pull Request #896 · triton-inference-server/client

yinggeh · 2026-05-13T07:44:07Z

Summary

Adds per-request cancellation to the C++ gRPC client, mirroring the existing Python tritonclient.grpc cancellation interfaces.

AsyncInfer gains an optional trailing CallContext** ctx_out. On Error::Success, the caller receives a heap-allocated CallContext whose Cancel() method calls grpc::ClientContext::TryCancel(). Cancel() after natural completion is a safe no-op.
AsyncInferMulti gains an optional trailing std::vector<CallContext*>* ctxs_out — one handle per fanned-out request. Cancellation is per-request; the multi callback still fires exactly once after every leaf produces a result.
StopStream gains a bool cancel_requests = false. When true, the streaming RPC is TryCancel'd and the stream callback receives one final InferResult whose status contains Locally cancelled by application!.
After a cancelled stream, StartStream can be called again — grpc_context_ is rebuilt in place because grpc::ClientContext is non-movable and a cancelled context cannot be reused.
All cancellation paths surface the same Python-parity message Locally cancelled by application!, matching tritonclient.grpc._utils.get_cancelled_error() in the Python client.

Backwards compatibility

Every new parameter defaults to nullptr / false. Existing call sites compile and behave exactly as before; cancellation is strictly opt-in.

Testing

New src/c++/tests/grpc_cancellation_test.cc (gtest) with 7 cases:

Test	Covers
`TestGrpcAsyncInfer`	Python `test_grpc_async_infer` parity (1:1)
`TestGrpcAsyncInferCancelAfterCompletionIsNoOp`	Cancel-after-finish is safe, no double callback
`TestGrpcAsyncInferWithoutContextStillCompletes`	Default arg path is unchanged
`TestGrpcAsyncInferMulti`	Cancels requests 0 and 2, lets request 1 complete; verifies per-request cancel + result-order preservation + single multi-callback fire
`TestGrpcStreamInfer`	Python `test_grpc_stream_infer` parity (1:1)
`TestGrpcStreamCancelWithoutInfer`	Cancel an empty stream still emits the cancel message
`TestGrpcStreamCancelThenRestart`	Cancelled stream → fresh stream → successful inference

                   const std::vector<std::vector<const InferRequestedOutput*>>& outputs,
-                  const Headers& headers, grpc_compression_algorithm compression_algorithm)
+                  const Headers& headers, grpc_compression_algorithm compression_algorithm,
+                  std::vector<CallContext*>* ctxs_out)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add request cancellation to C++ gRPC client#896

feat: Add request cancellation to C++ gRPC client#896
yinggeh wants to merge 1 commit into
mainfrom
yinggeh/tri-967-riva-speech-skills-cpp-clients-do-not-support-request

yinggeh commented May 13, 2026 •

edited

Loading

Uh oh!

whoisj left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mudit-eng May 14, 2026

Uh oh!

yinggeh May 14, 2026

Uh oh!

mudit-eng May 15, 2026

Uh oh!

yinggeh May 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

yinggeh commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Backwards compatibility

Testing

Related

Uh oh!

whoisj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mudit-eng May 14, 2026

Choose a reason for hiding this comment

Uh oh!

yinggeh May 14, 2026

Choose a reason for hiding this comment

Uh oh!

mudit-eng May 15, 2026

Choose a reason for hiding this comment

Uh oh!

yinggeh May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

yinggeh commented May 13, 2026 •

edited

Loading

yinggeh May 15, 2026 •

edited

Loading