Add huggingface/pytorch/tei/docker/1.9.3#192
Conversation
|
The final stage is named Affects both |
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>
TEI 1.9.3 SM Test Failure — Root Cause AnalysisThe Issue 1: SageMaker passes
|
Description of changes:
This PR adds the Text Embeddings Inference (TEI) v1.9.3 container to be released on both AWS SageMaker and AWS EC2, as per the latest release of the upstream in https://github.com/huggingface/text-embeddings-inference/releases/v1.9.3.
This PR is pretty similar to earlier releases as #141 or #153, with some subtle differences:
entrypoint.shno longer lives on https://github.com/huggingface/text-embeddings-inference, but rather in this repository insteadtext-embeddings-routercommand in theentrypoint.shno longer requires the--port 8080flag, as the environment variablePORThas been updated to be set to 8080 instead of 80.entrypoint.shfor both CPU and NVIDIA GPUs runs thetext-embeddings-routerbinary withexecso that the signals are captured by the process, as otherwise the running PID oftext-embeddings-routerwould be different to 1, hence wouldn't capture the signalsDockerfilefiles drop the gRPC layer as it wasn't used but it was being compiled / built nonethelessBy submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.
Note
By submitting this PR, I disclose that all the code in this PR was written entirely by me, @alvarobartt, without the use of any coding assistants or third-party agentic tools.