VoxAI Orchestrator

 ##     ##  #######  ##     ##    ###    ####
 ##     ## ##     ##  ##   ##    ## ##    ##
 ##     ## ##     ##   ## ##    ##   ##   ##
 ##     ## ##     ##    ###    ##     ##  ##
  ##   ##  ##     ##   ## ##   #########  ##
   ## ##   ##     ##  ##   ##  ##     ##  ##
    ###     #######  ##     ## ##     ## ####

A unified local AI workspace with agentic capabilities

Local LLM Chat • Cloud Providers • Image & Video Generation • Agentic Web Search • Remote Access

Beta software: active development, frequent iteration, and occasional breaking changes are normal.

What Makes VoxAI Different?

VoxAI is not just a model launcher. It is a local-first AI workspace that combines:

local GGUF chat
cloud/provider chat
local diffusion image generation
provider image and video generation
agentic search behavior
a desktop-managed public WebUI

The goal is to keep a powerful desktop app while also exposing a polished browser front end when you choose to run IronGate.

Open-Source Build Notes

This repository is the public OSS project.

The desktop app, WebUI, local model support, provider integrations, and remote access flow are included here.
Private monetization/runtime pieces are not meant to live in the public repo.
Mutable user/account/payment/runtime state should stay in ignored runtime files only.
Use example/template configs and keep your real keys in local private files.

In practical terms, this repo should contain the product and tooling, but not your private billing or cloud-storage business runtime.

Core Features

Chat

Local GGUF chat with llama.cpp/Vulkan-based workflows
Ollama support
Provider chat for Gemini, OpenAI, OpenRouter, DeepSeek, Kimi, Mistral, xAI, and Z.ai
Streaming responses
Code-aware responses and side-panel extraction
Multi-turn conversation context
Agentic search support through the desktop service

Image And Video

Native local image generation without ComfyUI dependency
Local support for SD 1.5, SDXL, Pony, and Flux
Provider image/video generation adapters
Prompt translation / prompt enhancement helpers
WebUI image queue, gallery, and mobile-friendly generation flow

IronGate WebUI

Obsidian-based WebUI layout
Mobile-specific front-end flow
OAuth-capable login flow
Cloudflare-first public tunnel flow with Ngrok fallback
Desktop-controlled public gateway lifecycle
Account/profile surfaces for WebUI users

Quick Start

Prerequisites

Windows is the primary target
Python 3.10+
AMD or NVIDIA GPU recommended for local generation
8GB+ VRAM recommended for the broader local feature set

Fast Start

git clone https://github.com/AlexC1991/AI_GUI.git
cd AI_GUI
start.bat

start.bat handles:

virtual environment setup
dependency installation
runtime directory bootstrapping
launcher startup

Manual Setup

python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt
copy config.example.json config.json
python main.py

Local Runtime DLLs

For Vulkan-accelerated local inference, place the required llama.cpp runtime DLLs in the project root:

llama.dll
ggml.dll
ggml-vulkan.dll
ggml-cpu-haswell.dll

Model Setup

Chat Models

Place local .gguf chat models in models/llm/.

Recommended examples:

Qwen3-8B-Q5_K_M.gguf
Llama-3.2-3B-Instruct.gguf
DeepSeek-R1-Distill-8B.gguf

Local Image Models

Typical local image model layout:

AI_GUI/
└── models/
    ├── checkpoints/
    ├── loras/
    ├── vae/
    └── text_encoders/

Supported local image model file types include:

.safetensors
.ckpt
.gguf

Cloud Providers

Provider keys can be configured through the desktop Settings UI or by editing your local config files.

Current chat/provider stack in the project includes:

Gemini
OpenAI
OpenRouter
DeepSeek
Kimi
Mistral
xAI
Z.ai
Ollama

The project also includes media-provider routing for image/video-capable services where configured.

Flux On 8GB VRAM

Current public README state:

Flux Q2 dev / GGUF path: supported for 8GB-class usage
Flux FP8 safetensor path: still partial / half-working

Recommended 8GB-oriented files:

flux1-schnell-Q4_K_S.gguf in models/checkpoints/
t5-v1_1-xxl-encoder-Q4_K_M.gguf in models/text_encoders/
clip_l.safetensors in models/text_encoders/

IronGate Remote Access

IronGate can expose your local app through a public tunnel.

Preferred Setup

Use Cloudflare as the primary public tunnel
Keep Ngrok as the secondary/fallback option
Launch the public gateway from the desktop app
Keep desktop control enabled so the public server only runs while the desktop app is open

Host Config

Copy:

gateway/host_config.example.json -> gateway/host_config.json

Then configure your tunnel and OAuth settings locally.

OAuth Callback Examples

https://your-domain.example/auth/google/callback
https://your-domain.example/auth/github/callback

WebUI Highlights

provider-backed chat, image, and video surfaces
mobile-focused WebUI layout improvements
account/profile panels
local history and user preference persistence
theme/accent support

Repo Hygiene

The repo is intentionally set up to keep local/runtime artifacts out of source control.

Ignored or local-only areas include things like:

model weights and offload blobs
runtime gateway data
secret/config key files
generated outputs
future memory-manager work under VOXAI_Memory_Manager_Safetensor/

Placeholder keep-files preserve directory structure for:

outputs/images/
engine/data/conversations/
engine/data/vectordb/
gateway/exports/
gateway/uploads/
gateway/uploads_desktop/

Project Structure

AI_GUI/
├── backend/              # Chat/image workers and pipeline orchestration
├── engine/               # Local chat engine and memory systems
├── gateway/              # IronGate FastAPI host, templates, static assets
├── ide_core/             # IDE-side helpers and tooling
├── providers/            # Chat, image, and video provider adapters
├── widgets/              # Desktop PySide6 UI
├── config.example.json
├── main.py
├── main_window.py
└── start.bat

Configuration

Main App Config

Copy:

config.example.json -> config.json

Then add the local/provider settings you want to use.

Web Gateway Host Config

Copy:

gateway/host_config.example.json -> gateway/host_config.json

That host config is where local tunnel/OAuth settings belong.

Dependencies

Main dependency groups in the project include:

PySide6 for the desktop UI
diffusers, transformers, and torch for image pipelines
provider SDK/client layers such as OpenAI and Gemini-related packages
fastapi and uvicorn for IronGate
utility packages like psutil, colorama, and related helpers

Install everything with:

pip install -r requirements.txt

Contributing

Contributions, issue reports, and cleanup suggestions are welcome.

License

MIT. See LICENSE.

Built for the local AI community

Giving small models big capabilities

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.idea		.idea
backend		backend
configs		configs
engine		engine
gateway		gateway
ide_core		ide_core
keys		keys
models		models
outputs/images		outputs/images
providers		providers
resources		resources
screenshots		screenshots
tools		tools
utils		utils
widgets		widgets
.gitignore		.gitignore
LICENSE		LICENSE
ReadMe.md		ReadMe.md
bootstrap.py		bootstrap.py
config.example.json		config.example.json
main.py		main.py
main_window.py		main_window.py
requirements.txt		requirements.txt
run_web_server.bat		run_web_server.bat
setup.bat		setup.bat
start.bat		start.bat
web_server.py		web_server.py

Folders and files

Latest commit

History

Repository files navigation

VoxAI Orchestrator

What Makes VoxAI Different?

Open-Source Build Notes

Core Features

Chat

Image And Video

IronGate WebUI

Quick Start

Prerequisites

Fast Start

Manual Setup

Local Runtime DLLs

Model Setup

Chat Models

Local Image Models

Cloud Providers

Flux On 8GB VRAM

IronGate Remote Access

Preferred Setup

Host Config

OAuth Callback Examples

WebUI Highlights

Repo Hygiene

Project Structure

Configuration

Main App Config

Web Gateway Host Config

Dependencies

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages