Skip to content

api-evangelist/fal-ai

Repository files navigation

fal (fal-ai)

fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU infrastructure. fal Serverless lets developers ship custom models with @fal.function / fal.App / BYO containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity, Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.

APIs.json: https://raw.githubusercontent.com/api-evangelist/fal-ai/refs/heads/main/apis.yml

Scope

  • Position: Consuming
  • Access: 3rd-Party

Tags

  • AI
  • Artificial Intelligence
  • Generative AI
  • Generative Media
  • Image Generation
  • Video Generation
  • Audio Generation
  • Inference
  • Serverless
  • GPU
  • MCP

Timestamps

  • Created: 2026-05-25
  • Modified: 2026-05-25

APIs

fal Model APIs

Unified queue-based REST API for invoking 1,000+ generative image, video, audio, and multimodal models hosted on fal's inference infrastructure. Submit a request to https://queue.fal.run/{model-id}, poll /requests/{request_id}/status or /requests/{request_id} for progress and results, or subscribe to webhook callbacks. Supports synchronous responses, asynchronous queueing, server-sent streaming progress, and request cancellation. Powers flagship models including FLUX, Veo 3, Kling 2.5, Wan 2.5, Seedream, Nano Banana, Qwen, SDXL, and Stable Diffusion variants.

Tags

  • AI
  • Generative AI
  • Image Generation
  • Video Generation
  • Audio Generation
  • Multimodal
  • Inference

Properties

fal Realtime API

WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-directional binary/JSON messaging keeps a persistent connection open so each frame, prompt, or pose adjustment is processed in milliseconds. Powers fal.realtime client utilities used in canvas apps, drawing tools, AR experiences, and live video pipelines.

Tags

  • AI
  • Generative AI
  • Realtime
  • WebSocket
  • Streaming
  • Inference

Properties

fal Streaming API

HTTP streaming endpoint (/{model-id}/stream) that emits progressive partial outputs as a model runs — used for LLM/VLM token streams, incremental video frames, and step-by-step image diffusion previews. Compatible with Server-Sent Events parsers in the official fal-client SDKs.

Tags

  • AI
  • Generative AI
  • Streaming
  • Server-Sent Events
  • Inference

Properties

fal Storage API

REST endpoints for uploading binary inputs (images, audio clips, reference frames, control maps) to fal's CDN so they can be referenced by URL when invoking model APIs. Issues short-lived signed upload URLs via /storage/upload/initiate and serves the resulting assets from https://v3.fal.media.

Tags

  • AI
  • Generative AI
  • File Upload
  • Storage
  • CDN

Properties

fal Serverless Platform API

Programmatic management of custom fal Serverless applications — list, inspect, deploy, scale, and monitor user-defined GPU functions deployed with @fal.function, fal.App, or BYO containers. Covers app metadata, secrets, file volumes, scaling parameters (keep_alive, min_concurrency), and execution analytics.

Tags

  • AI
  • Serverless
  • GPU
  • Deployments
  • Platform

Properties

fal Models Catalog API

Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model metadata, capability tags, pricing per output, supported parameters, example inputs, and OpenAPI schemas per model. Backs the model gallery, search, and SDK tooling.

Tags

  • AI
  • Generative AI
  • Catalog
  • Discovery

Properties

fal Compute API

Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning, and persistent workloads. Hourly or per-second billing with no lock-in.

Tags

  • AI
  • GPU
  • Compute
  • Infrastructure
  • Dedicated

Properties

fal API Keys API

Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model, Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.

Tags

  • AI
  • Administration
  • Authentication
  • API Keys

Properties

fal Usage and Billing API

Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing history. Surfaces the same data shown on the fal dashboard so platform teams can pipe inference cost into internal FinOps tooling.

Tags

  • AI
  • Administration
  • Usage
  • Billing
  • FinOps

Properties

Common Properties

Maintainers

FN: Kin Lane Email: info@apievangelist.com URL: https://apievangelist.com

Releases

No releases published

Packages

 
 
 

Contributors