From 3cd3bf2f6929f1583d106b773283fa40a4bd346c Mon Sep 17 00:00:00 2001 From: Saylor Berman Date: Fri, 19 Dec 2025 12:19:20 -0700 Subject: [PATCH] docs: Update NGF inference doc Updated the supported inference version. Also removed a port that isn't used. --- content/ngf/how-to/gateway-api-inference-extension.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/content/ngf/how-to/gateway-api-inference-extension.md b/content/ngf/how-to/gateway-api-inference-extension.md index 5cfec3330..7c4d5633d 100644 --- a/content/ngf/how-to/gateway-api-inference-extension.md +++ b/content/ngf/how-to/gateway-api-inference-extension.md @@ -70,7 +70,7 @@ NGINX will query the Endpoint Picker Extension to determine the appropriate pod {{< call-out "warning" >}} The Endpoint Picker Extension is a third-party application written and provided by the Gateway API Inference Extension project. Communication between NGINX and the Endpoint Picker uses TLS with certificate verification disabled by default, as the Endpoint Picker does not currently support mounting CA certificates. The Gateway API Inference Extension is in alpha status and should not be used in production. NGINX Gateway Fabric is not responsible for any threats or risks associated with using this third-party Endpoint Picker Extension application. {{< /call-out >}} ```shell -export IGW_CHART_VERSION=v1.0.1 +export IGW_CHART_VERSION=v1.1.0 helm install vllm-llama3-8b-instruct \ --set inferencePool.modelServers.matchLabels.app=vllm-llama3-8b-instruct \ --version $IGW_CHART_VERSION \ @@ -113,7 +113,7 @@ GW_IP=XXX.YYY.ZZZ.III GW_PORT= ``` -## Deploy a HTTPRoute +## Deploy an HTTPRoute ```yaml kubectl apply -f - <