Skip to content

chore(pricing): Update fireworks-ai pricing#549

Open
siddharthsambharia-portkey wants to merge 60 commits intomainfrom
pricing-update/fireworks-ai
Open

chore(pricing): Update fireworks-ai pricing#549
siddharthsambharia-portkey wants to merge 60 commits intomainfrom
pricing-update/fireworks-ai

Conversation

@siddharthsambharia-portkey
Copy link
Copy Markdown
Collaborator

@siddharthsambharia-portkey siddharthsambharia-portkey commented Mar 17, 2026

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 17
🔄 Models updated (merged) 4

➕ New Models

  • deepseek-v3p1
  • deepseek-v3p2
  • glm-4p7
  • glm-5
  • qwen3-vl-30b-a3b-instruct
  • qwen3-vl-30b-a3b-thinking
  • gpt-oss-120b
  • gpt-oss-20b
  • minimax-m2p1
  • minimax-m2p5
  • llama-v3p3-70b-instruct
  • qwen3-8b
  • flux-1-dev-fp8
  • flux-1-schnell-fp8
  • flux-kontext-pro
  • flux-kontext-max
  • qwen3-embedding-8b

🔄 Updated Models

  • kimi-k2-instruct-0905
  • kimi-k2-thinking
  • kimi-k2p5
  • mixtral-8x22b-instruct

Model → Pricing Category Mapping

Named Families (exact page values)

Model ID Pricing Row Input Output Cache Read
deepseek-v3p1, deepseek-v3p2 DeepSeek V3 family $0.56 $1.68 $0.28 (50%)
glm-4p7 GLM-4.7 $0.60 $2.20 $0.30 (50%)
glm-5 GLM-5 $1.00 $3.20 $0.20 (specified)
qwen3-vl-30b-a3b-instruct, qwen3-vl-30b-a3b-thinking Qwen3 VL 30B A3B $0.15 $0.60 $0.075 (50%)
kimi-k2-instruct-0905, kimi-k2-thinking Kimi K2 Instruct/Thinking $0.60 $2.50 $0.30 (50%)
kimi-k2p5 Kimi K2.5 $0.60 $3.00 $0.10 (specified)
gpt-oss-120b OpenAI gpt-oss-120b $0.15 $0.60 $0.075 (50%)
gpt-oss-20b OpenAI gpt-oss-20b $0.07 $0.30 $0.035 (50%)
minimax-m2p1, minimax-m2p5 MiniMax M2 family $0.30 $1.20 $0.03 (specified)

Tier-Based Text Models

Model ID Tier Input Output
llama-v3p3-70b-instruct >16B $0.90 $0.90
mixtral-8x22b-instruct MoE 56.1-176B $1.20 $1.20
qwen3-8b 4B-16B $0.20 $0.20

Image Models

Model ID Type Pricing
flux-1-dev-fp8 Per-step $0.0005/step
flux-1-schnell-fp8 Per-step $0.00035/step
flux-kontext-pro Per-image $0.04/image
flux-kontext-max Per-image $0.08/image

Embedding Models

Model ID Input
qwen3-embedding-8b $0.10/1M tokens

Skipped

  • qwen3-reranker-8b — Reranker model (excluded per skill rules)

Generated by Pricing Agent on 2026-04-03

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant