Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
eaeccd9
chore: adjust installation.md
paulbuehlerdev Jan 15, 2026
943e02d
chore: adjust features.md
paulbuehlerdev Jan 15, 2026
c9c2f13
chore: remove redundant env configuration.
paulbuehlerdev Jan 16, 2026
92fdd9b
chore: adjust kibana dashboard example
paulbuehlerdev Jan 16, 2026
5229088
chore: improve job general page
paulbuehlerdev Jan 19, 2026
4793a43
chore: improve capture page
paulbuehlerdev Jan 19, 2026
4cdad31
chore: adjust composer page
paulbuehlerdev Jan 19, 2026
28029fc
chore: adjust job delivery page
paulbuehlerdev Jan 19, 2026
96b6e85
chore: adjust data retention pages
paulbuehlerdev Jan 19, 2026
e2d6882
chore: change delivery interface page
paulbuehlerdev Jan 20, 2026
a460d56
chore: adjust delivery interface pages
paulbuehlerdev Jan 20, 2026
7fcba74
chore: improve spaces and authentication
paulbuehlerdev Jan 20, 2026
ac5458a
chore: improve self monitoring
paulbuehlerdev Jan 20, 2026
7d1378c
chore: improve backup page
paulbuehlerdev Jan 20, 2026
b5a47a0
chore: adjust ai providers page
paulbuehlerdev Jan 20, 2026
90c7b08
chore: adjust local auth page
paulbuehlerdev Jan 21, 2026
c6d3407
chore: improve ldap page
paulbuehlerdev Jan 21, 2026
3b33184
chore: improve example pages
paulbuehlerdev Jan 21, 2026
6ffba3d
chore: improve grafana page
paulbuehlerdev Jan 21, 2026
530dd7d
chore: improve conditional report
paulbuehlerdev Jan 21, 2026
e858398
chore: improve advanced examples page
paulbuehlerdev Jan 21, 2026
6b71bcc
chore: improve mixed sources report
paulbuehlerdev Jan 21, 2026
9dddfa9
chore: improve ai news collation
paulbuehlerdev Jan 21, 2026
11f9420
chore: kibana anomaly
paulbuehlerdev Jan 21, 2026
35f4236
chore: improve kibana anomaly detection page
paulbuehlerdev Jan 22, 2026
67949c6
revert: changes on version stable
paulbuehlerdev Jan 22, 2026
7060013
feat: extend changelog
paulbuehlerdev Jan 22, 2026
1148d0e
chore: upgrade stable version
paulbuehlerdev Jan 22, 2026
544ea1d
chore: review findings
paulbuehlerdev Jan 22, 2026
2650a9e
chore: improve slack webhook
paulbuehlerdev Jan 22, 2026
abb87a6
chore: update stable
paulbuehlerdev Jan 22, 2026
6677453
revert: links
paulbuehlerdev Jan 22, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
284 changes: 31 additions & 253 deletions docs/administration/ai-providers.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,13 @@
---
sidebar_position: 3
description: Configure AI providers in Anaphora for intelligent report analysis - GPT-5.2, Claude 4.5, Grok 4, DeepSeek V3, Qwen3, Llama 4, and self-hosted options.
keywords: [AI providers, GPT-5, Claude 4.5, Grok 4, DeepSeek V3, Qwen3, Llama 4, vLLM, LLM integration, AI analysis]
keywords: [ AI providers, GPT-5, Claude 4.5, Grok 4, DeepSeek V3, Qwen3, Llama 4, vLLM, LLM integration, AI analysis ]
---

# AI Providers

Configure AI providers to enable intelligent analysis, content generation, and anomaly detection in your reports. Anaphora supports any provider implementing the OpenAI-compatible API specification.
Configure AI providers to enable intelligent analysis, content generation, and anomaly detection in your reports.
Anaphora supports any provider implementing the OpenAI-compatible API specification.

## Overview

Expand All @@ -32,196 +33,25 @@ flowchart LR
api -- "Analysis result" --> job
```

## OpenAI-Compatible API

Anaphora works with any provider implementing the OpenAI API specification. This includes:

| Provider | Description | Self-Hosted |
|----------|-------------|-------------|
| **OpenAI** | GPT-5.2, GPT-5.1 models | No |
| **Anthropic** | Claude Opus 4.5, Sonnet 4.5, Haiku 4.5 | No |
| **xAI** | Grok 4, Grok 3 models | No |
| **Azure OpenAI** | Microsoft-hosted OpenAI models | No |
| **DeepSeek** | DeepSeek-V3.2, cost-effective reasoning | No |
| **vLLM** | Self-hosted open-source models | Yes |
| **Ollama** | Local model runner | Yes |
| **LM Studio** | Desktop model runner | Yes |

## Configuration

### Adding a Provider

1. Navigate to **Settings** > **AI Providers**
1. Navigate to **AI Providers**
2. Click **Add Provider**
3. Configure the connection:

| Field | Description | Example |
|-------|-------------|---------|
| **Name** | Identifier for this provider | `Production GPT-5` |
| **API Endpoint** | OpenAI-compatible base URL | `https://api.openai.com/v1` |
| **API Key** | Authentication token | `sk-...` (stored encrypted) |
| **Default Model** | Model to use if not specified | `gpt-5.2` |
| Field | Description | Example |
|--------------|-------------------------------------------------------|--------------------------------|
| **Name** | Identifier for this provider | `Production GPT-5` |
| **Provider** | Type of AI provider | `OpenAI`, `DeepSeek`, `Custom` |
| **Endpoint** | OpenAI-compatible base URL (Only for custom provider) | `https://api.openai.com/v1` |
| **API Key** | Authentication token | `sk-...` (stored encrypted) |
| **Model** | Model to use. Needs API key to fetch suggestions. | `gpt-5.2` |

4. Test the connection
5. Save

### Multiple Providers

Configure multiple providers for different use cases:

```
Provider: DeepSeek-V3
├── Endpoint: https://api.deepseek.com/v1
├── Model: deepseek-v3.2
└── Use for: Complex analysis, reasoning tasks

Provider: vLLM-Qwen
├── Endpoint: http://vllm.internal:8000/v1
├── Model: Qwen3-32B-Instruct
└── Use for: High-volume processing, cost control

Provider: OpenAI-GPT5
├── Endpoint: https://api.openai.com/v1
├── Model: gpt-5.2
└── Use for: Critical reports, best quality

Provider: Anthropic-Claude
├── Endpoint: https://api.anthropic.com/v1
├── Model: claude-sonnet-4.5
└── Use for: Coding analysis, agentic tasks
```

### Multiple Models per Provider

Each provider can expose multiple models:

| Provider | Available Models |
|----------|-----------------|
| OpenAI | `gpt-5.2`, `gpt-5.2-codex`, `gpt-5.1` |
| Anthropic | `claude-opus-4.5`, `claude-sonnet-4.5`, `claude-haiku-4.5` |
| xAI | `grok-4`, `grok-4-heavy`, `grok-3`, `grok-3-mini` |
| DeepSeek | `deepseek-v3.2`, `deepseek-r1` |
| vLLM | Depends on deployed models (Qwen3, Llama 4, etc.) |

## Provider Types

### DeepSeek

Cost-effective models with strong reasoning capabilities. DeepSeek-V3.2 integrates thinking directly into tool-use.

**Configuration:**
```
Name: DeepSeek
Endpoint: https://api.deepseek.com/v1
API Key: sk-xxxxx
Default Model: deepseek-v3.2
```

**Available Models:**
- `deepseek-v3.2` — Latest flagship, thinking + tool-use integration
- `deepseek-v3.1` — Hybrid model combining V3 and R1 strengths
- `deepseek-r1` — Advanced reasoning, complex analysis

### vLLM (Self-Hosted)

Run open-source models on your infrastructure for data privacy and cost control.

**Configuration:**
```
Name: vLLM Internal
Endpoint: http://vllm-server.internal:8000/v1
API Key: (optional, depends on deployment)
Default Model: Qwen3-32B-Instruct
```

**Popular Models for vLLM:**
- `Qwen3-32B-Instruct` — Strong general purpose, 128K context
- `Qwen3-235B-A22B` — Massive MoE model (22B active params)
- `Llama-4-Maverick` — Meta's multimodal MoE (17B active, 400B total)
- `Llama-4-Scout` — 10M context window (17B active, 109B total)
- `Qwen3-Coder-480B` — State-of-the-art coding (35B active)

**Benefits:**
- Complete data privacy — data never leaves your infrastructure
- No per-token costs after initial setup
- Customizable model selection
- Low latency for internal networks

### OpenAI

Direct integration with OpenAI's API. GPT-5.2 is the current flagship with Instant, Thinking, and Pro variants.

**Configuration:**
```
Name: OpenAI
Endpoint: https://api.openai.com/v1
API Key: sk-proj-xxxxx
Default Model: gpt-5.2
```

**Available Models:**
- `gpt-5.2` — Latest flagship, best for coding and agentic tasks
- `gpt-5.2-codex` — Optimized for agentic coding workflows
- `gpt-5.1` — Previous generation (Instant and Thinking variants)

### Anthropic Claude

State-of-the-art models for coding, agents, and complex reasoning. Claude 4.5 series offers excellent cost-performance.

**Configuration:**
```
Name: Anthropic
Endpoint: https://api.anthropic.com/v1
API Key: sk-ant-xxxxx
Default Model: claude-sonnet-4.5
```

**Available Models:**
- `claude-opus-4.5` — Flagship, long-horizon autonomous tasks ($5/$25 per M tokens)
- `claude-sonnet-4.5` — Best balance of quality and cost ($3/$15 per M tokens)
- `claude-haiku-4.5` — Fast and efficient ($1/$5 per M tokens)

### xAI Grok

Strong reasoning models with real-time knowledge. Grok 4 is the latest flagship with 1M token context.

**Configuration:**
```
Name: xAI Grok
Endpoint: https://api.x.ai/v1
API Key: xai-xxxxx
Default Model: grok-4
```

**Available Models:**
- `grok-4` — Latest flagship, strong reasoning and tool use
- `grok-4-heavy` — Maximum capability variant
- `grok-3` — Previous generation, good cost-performance balance
- `grok-3-mini` — Fast responses, lower cost

### Azure OpenAI

Enterprise deployment through Microsoft Azure.

**Configuration:**
```
Name: Azure OpenAI
Endpoint: https://your-resource.openai.azure.com
API Key: xxxxx
Default Model: gpt-5 (deployment name)
```

## Space-Level Configuration

AI Providers are configured per Space for isolation:

| Space | AI Provider | Use Case |
|-------|-------------|----------|
| Production | OpenAI GPT-5.2 | Critical reports, highest quality |
| Development | vLLM Qwen3 | Testing, iteration |
| Client-Acme | DeepSeek V3.2 | Cost-effective analysis |
| Client-Beta | Claude Sonnet 4.5 | Balanced quality and cost |

### Provider Inheritance

- Each Space can have its own AI provider configuration
Expand All @@ -230,25 +60,29 @@ AI Providers are configured per Space for isolation:

## Using AI in Jobs

### Composer Integration
### Job Capture Integration

Add AI-generated content blocks in the Composer:
Use AI actions in the job actions:

1. Add an **AI Analysis** block
2. Select the analysis type:
- **Summary** — Condense captured data
- **Trends** — Identify patterns over time
- **Anomalies** — Highlight unusual values
- **Custom** — Provide your own prompt
1. Edit or create a job
2. Navigate to the **Capture** tab
3. Enable advanced mode to use actions.
4. Add **AI** action
5. Configure:

3. Configure the prompt (for custom analysis)
4. Preview the output
| Field | Description |
|---------------|-----------------------------------------|
| Variable Name | Name of the variable to store AI output |
| Provider | Select the AI provider to use |
| Prompt | Define the prompt with instructions |
| Output type | Text, HTML, Number |
| Context | Choose what to include as context |

### Example: Dashboard Summary

```
Block: AI Analysis
Type: Summary
Context: Dashboard snapshot showing system performance metrics
Type: Text
Prompt: "Summarize the key metrics from this dashboard,
highlighting any values that exceed normal ranges."

Expand All @@ -261,70 +95,14 @@ Output: "System performance remains stable with 99.8% uptime.
### Example: Anomaly Detection

```
Block: AI Analysis
Type: Anomaly Detection
Context: Error rate visualization captured from Kibana
Type: Number
Prompt: "Identify any anomalies in the error rate data. Respond with 0 if none found. Return 1 if anomalies are detected."

Output: "⚠️ Anomaly detected: Error rate spiked to 4.2% between
09:15-09:45 UTC, significantly above the 0.5% baseline.
This correlates with the deployment at 09:12 UTC visible
in the deployment timeline panel."
Output: 0
```

## Best Practices

### Provider Selection

| Use Case | Recommended Provider |
|----------|---------------------|
| **Security-sensitive data** | Self-hosted vLLM (Qwen3, Llama 4) |
| **Cost-sensitive high volume** | DeepSeek V3.2 or vLLM |
| **Best quality analysis** | OpenAI GPT-5.2 or Claude Opus 4.5 |
| **Coding and agentic tasks** | Claude Sonnet 4.5 or GPT-5.2-Codex |
| **Compliance requirements** | Azure OpenAI or self-hosted |

### Prompt Engineering

For consistent results:

- Be specific about the desired output format
- Provide context about what the data represents
- Specify the audience (technical vs. executive)
- Include examples of good output when possible

### Cost Management

- Use appropriate models for each task (don't use Opus 4.5 for simple summaries — use Haiku 4.5)
- Set token limits for responses
- Monitor usage through provider dashboards
- Consider self-hosted options for high-volume use cases (Qwen3, Llama 4)

### Reliability

- Configure fallback providers for critical jobs
- Test AI features in development Spaces first
- Set reasonable timeout values
- Handle AI failures gracefully (jobs should complete even if AI fails)

## Troubleshooting

### Connection Issues

| Issue | Solution |
|-------|----------|
| Connection timeout | Check endpoint URL, verify network access |
| 401 Unauthorized | Verify API key is correct and active |
| 429 Rate Limited | Reduce request frequency, upgrade plan |
| Model not found | Check model name matches provider's offerings |

### Quality Issues

| Issue | Solution |
|-------|----------|
| Poor analysis quality | Improve prompts, try different model |
| Inconsistent outputs | Add more specific instructions |
| Missing context | Ensure captured data is included |
| Hallucinations | Use more constrained prompts |
Use the output to add a condition in your job workflow for alerting.

## Next Steps

Expand Down
Loading