Skip to content

Commit 65dfd34

Browse files
author
Project Team
committed
Raise default Ollama timeout from 60s to 120s
llama3.2-vision on a T4 can take 60-90s for inference on some images, particularly during the image-encoding phase before the first token. With a 60s timeout, the first request sometimes consumed the entire budget, leaving queued requests nothing to wait with. 120s gives enough headroom for worst-case inference while still bounding truly hung requests.
1 parent 015127b commit 65dfd34

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

app/config.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ class Settings(BaseSettings):
2626
description="Ollama vision model for label OCR"
2727
)
2828
ollama_timeout_seconds: int = Field(
29-
default=60,
29+
default=120,
3030
description="Timeout for Ollama OCR requests in seconds"
3131
)
3232

0 commit comments

Comments
 (0)