Model Configuration

Model Configuration Guide

VaultAI supports fine-tuning AI responses through advanced model configuration parameters. These settings allow you to control the creativity, randomness, and length of generated content.

Configuration Parameters

Temperature (0.0 - 1.0)

Controls the randomness of AI responses.

0.0: Deterministic, always chooses most likely response
0.3-0.5: Balanced, good for factual content
0.7-0.9: Creative, good for writing and brainstorming
1.0: Maximum creativity and randomness

Default: 0.7

Top-K (1 - 40)

Limits the AI to consider only the top K most probable tokens at each step.

1-10: Very focused responses
20-30: Balanced variety (recommended)
40: Maximum vocabulary diversity

Default: 40

Top-P (0.0 - 1.0)

Nucleus sampling - considers tokens that make up the top P probability mass.

0.1-0.3: Very focused
0.7-0.9: Balanced (recommended)
1.0: Considers all tokens

Default: 0.9

Max Output Tokens (1 - 8192)

Maximum length of generated responses.

512: Short responses
1024: Medium responses
2048: Long responses (recommended)
4096-8192: Very long responses

Default: 2048

How to Configure

In Obsidian Plugin Settings

Open Settings → Community Plugins → VaultAI
Navigate to Model Configuration section
Adjust the sliders or enter values directly
Settings apply immediately to new conversations

Recommended Settings by Use Case

📝 Note-Taking & Summaries

Temperature: 0.3
Top-K: 20
Top-P: 0.8
Max Tokens: 1024

✍️ Creative Writing

Temperature: 0.9
Top-K: 40
Top-P: 0.95
Max Tokens: 2048

🤔 Analysis & Research

Temperature: 0.5
Top-K: 30
Top-P: 0.9
Max Tokens: 2048

💡 Brainstorming

Temperature: 0.8
Top-K: 35
Top-P: 0.9
Max Tokens: 1024

Tips & Best Practices

Start Conservative: Begin with lower temperature (0.5-0.7) and adjust based on results
Experiment: Different tasks benefit from different settings
Balance Parameters: High temperature with low top-K can produce inconsistent results
Token Limits: Higher token limits allow for more detailed responses but use more API quota
Save Presets: Create custom presets for different types of work

Troubleshooting

Responses Too Random/Nonsensical

Lower temperature (try 0.3-0.5)
Reduce top-K (try 20)
Lower top-P (try 0.8)

Responses Too Repetitive/Boring

Increase temperature (try 0.7-0.9)
Increase top-K (try 30-40)
Increase top-P (try 0.95)

Responses Cut Off

Increase max output tokens
Break complex requests into smaller parts

API Quota Usage Too High

Reduce max output tokens
Use more focused prompts
Consider using lower temperature for factual queries

Advanced Configuration

For power users, VaultAI also supports:

Model Selection: Choose between different Gemini models
Context Window: Adjust how much conversation history to include
Streaming: Enable real-time response streaming
Custom System Prompts: Set behavior guidelines for the AI

See Advanced Settings for more details.

Need Help? Check out our Troubleshooting Guide or join the Community Discussions.

🤖 VaultAI Wiki

Intelligent AI Writing Assistant for Obsidian

📚 Quick Navigation

🏠 Home • ⚡ Getting Started • 🎯 Deep Editor • ✨ Custom Prompts • ⌨️ Shortcuts • 🛠️ Installation • 💡 Tips & Tricks

💬 Community & Support

� Report Issues • 💡 Feature Requests • �️ Discussions • ❓ Troubleshooting

Made with ❤️ by Neo • Transforming note-taking with AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model Configuration

Model Configuration Guide

Configuration Parameters

Temperature (0.0 - 1.0)

Top-K (1 - 40)

Top-P (0.0 - 1.0)

Max Output Tokens (1 - 8192)

How to Configure

In Obsidian Plugin Settings

Recommended Settings by Use Case

📝 Note-Taking & Summaries

✍️ Creative Writing

🤔 Analysis & Research

💡 Brainstorming

Tips & Best Practices

Troubleshooting

Responses Too Random/Nonsensical

Responses Too Repetitive/Boring

Responses Cut Off

API Quota Usage Too High

Advanced Configuration

🤖 VaultAI Wiki

📚 Quick Navigation

💬 Community & Support

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally