Skip to content

v1.1.0 — Multi-Modal: Image Generation

Choose a tag to compare

@stackbilt-admin stackbilt-admin released this 01 Apr 15:54
· 22 commits to main since this release

Image Generation Provider

@stackbilt/llm-providers is now multi-modal — text + image inference under one package.

New: ImageProvider

import { ImageProvider } from '@stackbilt/llm-providers';

const img = new ImageProvider({
  cloudflareAi: env.AI,
  geminiApiKey: env.GEMINI_API_KEY,
});

const result = await img.generateImage({
  prompt: 'a mountain landscape at sunset',
  model: 'flux-dev',
});
// result.image: ArrayBuffer, result.responseTime, result.provider

Built-in Models

Model Provider Use Case
sdxl-lightning Cloudflare Fast drafts, free tier
flux-klein Cloudflare Balanced quality/speed
flux-dev Cloudflare Highest CF quality
gemini-flash-image Google Text rendering capable
gemini-flash-image-preview Google Latest preview model

Extracted from img-forge production codebase. Battle-tested response normalization handles all Workers AI return formats.

Full changelog: CHANGELOG.md