Raise default Ollama timeout from 60s to 120s

Project Team · Project Team · commit 65dfd34abfcc · 2026-02-20T12:22:10.000-06:00
llama3.2-vision on a T4 can take 60-90s for inference on some images,
particularly during the image-encoding phase before the first token.
With a 60s timeout, the first request sometimes consumed the entire
budget, leaving queued requests nothing to wait with. 120s gives
enough headroom for worst-case inference while still bounding truly
hung requests.
diff --git a/app/config.py b/app/config.py
@@ -26,7 +26,7 @@ class Settings(BaseSettings):
         description="Ollama vision model for label OCR"
     )
     ollama_timeout_seconds: int = Field(
-        default=60,
+        default=120,
         description="Timeout for Ollama OCR requests in seconds"
     )
     

Original file line number	Diff line number	Diff line change
`@@ -26,7 +26,7 @@ class Settings(BaseSettings):`
`26`	`26`	`description="Ollama vision model for label OCR"`
`27`	`27`	`)`
`28`	`28`	`ollama_timeout_seconds: int = Field(`
`29`		`- default=60,`
	`29`	`+ default=120,`
`30`	`30`	`description="Timeout for Ollama OCR requests in seconds"`
`31`	`31`	`)`
`32`	`32`