diff --git a/content/ngf/how-to/gateway-api-inference-extension.md b/content/ngf/how-to/gateway-api-inference-extension.md index 5cfec3330..7c4d5633d 100644 --- a/content/ngf/how-to/gateway-api-inference-extension.md +++ b/content/ngf/how-to/gateway-api-inference-extension.md @@ -70,7 +70,7 @@ NGINX will query the Endpoint Picker Extension to determine the appropriate pod {{< call-out "warning" >}} The Endpoint Picker Extension is a third-party application written and provided by the Gateway API Inference Extension project. Communication between NGINX and the Endpoint Picker uses TLS with certificate verification disabled by default, as the Endpoint Picker does not currently support mounting CA certificates. The Gateway API Inference Extension is in alpha status and should not be used in production. NGINX Gateway Fabric is not responsible for any threats or risks associated with using this third-party Endpoint Picker Extension application. {{< /call-out >}} ```shell -export IGW_CHART_VERSION=v1.0.1 +export IGW_CHART_VERSION=v1.1.0 helm install vllm-llama3-8b-instruct \ --set inferencePool.modelServers.matchLabels.app=vllm-llama3-8b-instruct \ --version $IGW_CHART_VERSION \ @@ -113,7 +113,7 @@ GW_IP=XXX.YYY.ZZZ.III GW_PORT= ``` -## Deploy a HTTPRoute +## Deploy an HTTPRoute ```yaml kubectl apply -f - <