Skip to content

ci: add automated test coverage for llmapi (PyTorch backend) path #854

@faradawn

Description

@faradawn

Background

PR #852 promotes the llmapi (PyTorch/LLM API) backend as the simpler getting-started path. Reviewer @yinggeh correctly noted that unlike inflight_batcher_llm, there is currently no automated CI coverage for the llmapi path.

The existing tests reference:

Goal

Add an equivalent end-to-end CI test for the llmapi path, covering:

  • Launch via launch_triton_server.py --model_repo=.../llmapi/
  • Basic generate request (/v2/models/tensorrt_llm/generate)
  • Optionally: request cancellation

Scope

This likely lives in NVIDIA/TensorRT-LLM under triton_backend/ci/ (mirroring L0_backend_trtllm), coordinated with the server repo's L0_openai_trtllm if OpenAI-compatible endpoint coverage is also needed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions