test: Add C++ gRPC cancellation tests to L0_request_cancellation by yinggeh · Pull Request #8775 · triton-inference-server/server

yinggeh · 2026-05-13T07:43:26Z

What does the PR do?

Wires the new C++ grpc_cancellation_test gtest (added in the client-side companion PR) into qa/L0_request_cancellation/test.sh as a sibling of the existing Python cancellation suite. Each gtest case is run against a fresh tritonserver and the count of Cancellation notification received for log lines is asserted to match the expected count for that case.

Also temporarily bumps the model's instance_group count to 3 around the TestGrpcAsyncInferMulti case (reverted after) so the three fanned-out requests can execute concurrently; the test cancels two of them while letting the middle one complete naturally and would otherwise serialize on a single CPU instance.

Checklist

Commit Type:

test

Related PRs:

triton-inference-server/client#896

Where should the reviewer start?

Test plan:

L0_request_cancellation--base

CI Pipeline ID: 51138710

Caveats:

Background

Triton has had Python-side request cancellation tests since r23.10 but no C++ counterpart, despite the C++ gRPC client gaining matching APIs in the companion client PR. This PR adds the missing test wiring so the two clients' cancellation behavior stays in lock-step on every CI run.

whoisj · 2026-05-14T15:19:51Z


 SERVER=/opt/tritonserver/bin/tritonserver
 source ../common/util.sh
+CANCEL_LOG_LINE="Cancellation notification received for"


probably needs a trailing space.

This sentence looks odd. There should be something after 'for'.
Do you have an example the actual log line looks like?

Two locations

server/src/grpc/infer_handler.cc

Lines 718 to 722 in 0c063cf

LOG_VERBOSE(1) << "Cancellation notification received for " << Name()

<< ", rpc_ok=" << rpc_ok << ", context "

<< state->context_->unique_id_ << " step "

<< state->context_->step_ << ", state "

<< state->unique_id_ << " step " << state->step_;

server/src/grpc/stream_infer_handler.cc

Lines 152 to 156 in 0c063cf

LOG_VERBOSE(1) << "Cancellation notification received for " << Name()

<< ", rpc_ok=" << rpc_ok << ", context "

<< state->context_->unique_id_ << " step "

<< state->context_->step_ << ", state "

<< state->unique_id_ << " step " << state->step_;

As J said, there is a trailing space after 'for'.

mudit-eng · 2026-05-15T19:11:33Z

-requests whose results are no longer required can significantly impact server
-resources.
+Triton supports handling request cancellation received from the gRPC Python
+client or a C API user (since r23.10), and C++ client (since r26.05).


This change is not going in r26.05.

Try to cherry-pick since internal team is waiting for this feature.

kind of a big overhaul this late into the release, no?

what is the risk assessment? (discuss offline)

mudit-eng · 2026-05-15T19:13:03Z


 SERVER=/opt/tritonserver/bin/tritonserver
 source ../common/util.sh
+CANCEL_LOG_LINE="Cancellation notification received for"


This sentence looks odd. There should be something after 'for'.
Do you have an example the actual log line looks like?

mudit-eng · 2026-05-15T19:15:40Z

+    TEST_LOG="./grpc_cancellation_test_cpp.$TEST_CASE.log"
+    SERVER_LOG="./grpc_cancellation_test_cpp.$TEST_CASE.server.log"
+
+    # AsyncInferMulti fans out N concurrent requests; bump to 3 CPU


is there a check for N >= 3?

Can you elaborate? Check N requests, instances or cancellation?

For N concurrent requests to fan out to 3 CPU, shouldn't we have N > 3?

In this test, each request execution takes 10 seconds. To avoid backlog in the request queue (reduce overall test time), the model configuration is increased to 3 instances. If N > 3, requests after 3rd will wait in the queue until the first 3 requests have completed execution, which will take 10 seconds.

I see what you mean. Here we are testing requests that are cancelled during execution. I can also add a test for in-queue request cancellation.

Thanks. Yes, let's test for in-queue cancellation also.

yinggeh mentioned this pull request May 13, 2026

feat: Add request cancellation to C++ gRPC client triton-inference-server/client#896

Open

yinggeh force-pushed the yinggeh/tri-967-riva-speech-skills-cpp-clients-do-not-support-request branch from 0a43ae8 to 308cad3 Compare May 13, 2026 07:54

Initial commit

0c063cf

yinggeh force-pushed the yinggeh/tri-967-riva-speech-skills-cpp-clients-do-not-support-request branch from 308cad3 to 0c063cf Compare May 14, 2026 04:58

yinggeh requested review from mudit-eng and whoisj May 14, 2026 04:59

yinggeh self-assigned this May 14, 2026

yinggeh added the PR: test Adding missing tests or correcting existing test label May 14, 2026

whoisj approved these changes May 14, 2026

View reviewed changes

mudit-eng reviewed May 15, 2026

View reviewed changes

	LOG_VERBOSE(1) << "Cancellation notification received for " << Name()
	<< ", rpc_ok=" << rpc_ok << ", context "
	<< state->context_->unique_id_ << " step "
	<< state->context_->step_ << ", state "
	<< state->unique_id_ << " step " << state->step_;

Conversation

yinggeh commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

yinggeh commented May 13, 2026 •

edited

Loading