fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU infrastructure. fal Serverless lets developers ship custom models with @fal.function / fal.App / BYO containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity, Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.
APIs.json: https://raw.githubusercontent.com/api-evangelist/fal-ai/refs/heads/main/apis.yml
- Position: Consuming
- Access: 3rd-Party
- AI
- Artificial Intelligence
- Generative AI
- Generative Media
- Image Generation
- Video Generation
- Audio Generation
- Inference
- Serverless
- GPU
- MCP
- Created: 2026-05-25
- Modified: 2026-05-25
Unified queue-based REST API for invoking 1,000+ generative image, video, audio, and multimodal models hosted on fal's inference infrastructure. Submit a request to https://queue.fal.run/{model-id}, poll /requests/{request_id}/status or /requests/{request_id} for progress and results, or subscribe to webhook callbacks. Supports synchronous responses, asynchronous queueing, server-sent streaming progress, and request cancellation. Powers flagship models including FLUX, Veo 3, Kling 2.5, Wan 2.5, Seedream, Nano Banana, Qwen, SDXL, and Stable Diffusion variants.
- Human URL: https://fal.ai/docs/model-apis/quickstart
- Base URL:
https://queue.fal.run
- AI
- Generative AI
- Image Generation
- Video Generation
- Audio Generation
- Multimodal
- Inference
- Documentation
- Documentation
- OpenAPI — OpenAPI Specification
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- JSON Schema — JSON Schema
- JSON Schema — JSON Schema
- JSON-LD — JSON-LD
WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-directional binary/JSON messaging keeps a persistent connection open so each frame, prompt, or pose adjustment is processed in milliseconds. Powers fal.realtime client utilities used in canvas apps, drawing tools, AR experiences, and live video pipelines.
- Human URL: https://fal.ai/docs/model-apis/real-time
- Base URL:
wss://realtime.fal.run
- AI
- Generative AI
- Realtime
- WebSocket
- Streaming
- Inference
- Documentation
- Code Examples
- AsyncAPI — AsyncAPI Specification
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
HTTP streaming endpoint (/{model-id}/stream) that emits progressive partial outputs as a model runs — used for LLM/VLM token streams, incremental video frames, and step-by-step image diffusion previews. Compatible with Server-Sent Events parsers in the official fal-client SDKs.
- Human URL: https://fal.ai/docs/model-apis/streaming
- Base URL:
https://queue.fal.run
- AI
- Generative AI
- Streaming
- Server-Sent Events
- Inference
- Documentation
- AsyncAPI — AsyncAPI Specification
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
REST endpoints for uploading binary inputs (images, audio clips, reference frames, control maps) to fal's CDN so they can be referenced by URL when invoking model APIs. Issues short-lived signed upload URLs via /storage/upload/initiate and serves the resulting assets from https://v3.fal.media.
- Human URL: https://fal.ai/docs/model-apis/file-uploads
- Base URL:
https://rest.alpha.fal.ai
- AI
- Generative AI
- File Upload
- Storage
- CDN
- Documentation
- OpenAPI — OpenAPI Specification
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
Programmatic management of custom fal Serverless applications — list, inspect, deploy, scale, and monitor user-defined GPU functions deployed with @fal.function, fal.App, or BYO containers. Covers app metadata, secrets, file volumes, scaling parameters (keep_alive, min_concurrency), and execution analytics.
- Human URL: https://fal.ai/docs/private-serverless-models
- Base URL:
https://rest.alpha.fal.ai
- AI
- Serverless
- GPU
- Deployments
- Platform
- Documentation
- SDK
- OpenAPI — OpenAPI Specification
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model metadata, capability tags, pricing per output, supported parameters, example inputs, and OpenAPI schemas per model. Backs the model gallery, search, and SDK tooling.
- Human URL: https://fal.ai/models
- Base URL:
https://fal.ai
- AI
- Generative AI
- Catalog
- Discovery
- Documentation
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning, and persistent workloads. Hourly or per-second billing with no lock-in.
- Human URL: https://fal.ai/compute
- Base URL:
https://rest.alpha.fal.ai
- AI
- GPU
- Compute
- Infrastructure
- Dedicated
- Documentation
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model, Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.
- Human URL: https://fal.ai/dashboard/keys
- Base URL:
https://rest.alpha.fal.ai
- AI
- Administration
- Authentication
- API Keys
- Documentation
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing history. Surfaces the same data shown on the fal dashboard so platform teams can pipe inference cost into internal FinOps tooling.
- Human URL: https://fal.ai/dashboard/usage
- Base URL:
https://rest.alpha.fal.ai
- AI
- Administration
- Usage
- Billing
- FinOps
- Documentation
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Postman Collection — Postman Collection 2.1
- Open Collection — Open Collection 1.0
- Arazzo Workflows — Arazzo Specification
- Portal
- Documentation
- Documentation
- Documentation
- Documentation
- Documentation
- Documentation
- Documentation
- Documentation
- Documentation
- Getting Started
- Status Page
- Blog
- Sign Up
- Pricing
- Support
- Forum
- Terms of Service
- Privacy Policy
- Trust Center
- GitHub Organization
- SDK
- SDK
- SDK
- SDK
- SDK
- SDK
- Tool
- Tool
- Tool
- Code Examples
- Code Examples
- Code Examples
- Documentation
- Documentation
- Plans
- Rate Limits
- Fin Ops
- Features
FN: Kin Lane Email: info@apievangelist.com URL: https://apievangelist.com