ci: add automated test coverage for llmapi (PyTorch backend) path

## Background

PR #852 promotes the `llmapi` (PyTorch/LLM API) backend as the simpler getting-started path. Reviewer @yinggeh correctly noted that unlike `inflight_batcher_llm`, there is currently no automated CI coverage for the `llmapi` path.

The existing tests reference:
- [`L0_backend_trtllm`](https://github.com/NVIDIA/TensorRT-LLM/tree/main/triton_backend/ci) — uses `inflight_batcher_llm`
- [`L0_openai_trtllm`](https://github.com/triton-inference-server/server/tree/main/qa/L0_openai) — uses `inflight_batcher_llm`

## Goal

Add an equivalent end-to-end CI test for the `llmapi` path, covering:
- Launch via `launch_triton_server.py --model_repo=.../llmapi/`
- Basic generate request (`/v2/models/tensorrt_llm/generate`)
- Optionally: request cancellation

## Scope

This likely lives in `NVIDIA/TensorRT-LLM` under `triton_backend/ci/` (mirroring `L0_backend_trtllm`), coordinated with the server repo's `L0_openai_trtllm` if OpenAI-compatible endpoint coverage is also needed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: add automated test coverage for llmapi (PyTorch backend) path #854

Background

Goal

Scope

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

ci: add automated test coverage for llmapi (PyTorch backend) path #854

Description

Background

Goal

Scope

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions