-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathconfig.yaml
More file actions
24 lines (22 loc) · 976 Bytes
/
config.yaml
File metadata and controls
24 lines (22 loc) · 976 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
vision:
base_url: "https://whatever.ai/v1"
api_key: "your_key" # or set VISION_API_KEY env var
model: "google/gemma-3-27b"
# model: "mlx-community/qwen3.5-122b-a10b"
max_tokens: 512
image_max_size: 1920 # resize longest edge before sending to API
prompts:
system: >
You are an expert photo metadata specialist. Analyze images carefully and return
accurate, descriptive tags and captions in both English and German.
Always respond with valid JSON only.
user: >
Analyze this image and return a JSON object with exactly four fields:
- "tags_en": an array of 5-8 concise English keyword strings (5-20 tags covering
subjects, colors, mood, setting, style, and any notable elements)
- "tags_de": the same tags translated into German
- "caption_en": a single English sentence describing the image clearly and specifically
- "caption_de": the same caption translated into German
server:
host: "0.0.0.0"
port: 8000