AI Cost Index logo AI Cost Index LIVE
JA EN

Vendor Detail

Fireworks

Observed model pricing snapshots and freshness signals for this vendor.

Models: 56 Latest Insight: 2026-03-12 Latest Snapshot: 2026-06-03 Evidence ↗

Quick Take

Fireworks is presented here as a vendor pricing reference page first: review model count, freshness, and the current lowest observed prices, then move into model-level pages for exact comparisons.

  • This page lists 56 observed model entries as of 2026-06-03.
  • The lowest currently observed prices are $0.1000 for input and $0.1000 for output per 1M tokens.
  • 9 listed models show recent movement, while 47 remain flat versus the previous snapshot.

The Evidence page provides the latest verification surface for this vendor. Final billing terms still need to be confirmed against the official pricing page and the vendor-specific offering conditions.

Deepseek Coder 1.3B Instruct

deepseek-coder-1-3b-instruct

FLAT

Input / 1M

$0.1000

Output / 1M

$0.1000

Updated: 2026-03-10 Model Detail

Gemma 2B Instruct

gemma-2b-instruct

FLAT

Input / 1M

$0.1000

Output / 1M

$0.1000

Updated: 2026-03-10 Model Detail

Starcoder 2 3B

starcoder-2-3b

FLAT

Input / 1M

$0.1000

Output / 1M

$0.1000

Updated: 2026-03-10 Model Detail

Qwen 1.5 0.5B Chat

qwen-1-5-0-5b-chat

FLAT

Input / 1M

$0.1500

Output / 1M

$0.1500

Updated: 2026-03-10 Model Detail

Qwen 1.5 1.8B Chat

qwen-1-5-1-8b-chat

FLAT

Input / 1M

$0.1500

Output / 1M

$0.1500

Updated: 2026-03-10 Model Detail

Qwen 1.5 4B Chat

qwen-1-5-4b-chat

FLAT

Input / 1M

$0.1500

Output / 1M

$0.1500

Updated: 2026-03-10 Model Detail

Code Llama 7B Instruct

codellama-2-7b-instruct

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Deepseek Coder 6.7B Instruct

deepseek-coder-6-7b-instruct

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Firefunction V2

firefunction-v2

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Gemma 2 9B IT

gemma-2-9b-it

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Gemma 7B Instruct

gemma-7b-instruct

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Gemma 7B IT

gemma-7b-it

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Hermes 2 Pro Llama 3 8B

hermes-2-pro-llama-3-8b

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Hermes 2 Pro Mistral 7B

hermes-2-pro-mistral-7b

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Llama 2 7B Chat

llama-2-7b-chat

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Llama 3 8B Instruct

llama-3-8b-instruct

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Mistral 7B Instruct v0.2

mistral-7b-instruct-v0-2

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Qwen 1.5 7B Chat

qwen-1-5-7b-chat

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Starcoder 2 7B

starcoder-2-7b

FLAT

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Llama 3.1 8B

llama-3-1-8b

UP

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Mistral Nemo

mistral-nemo

UP

Input / 1M

$0.2000

Output / 1M

$0.2000

Updated: 2026-03-10 Model Detail

Llama 3.1 8B Instruct

llama-3-1-8b-instruct

FLAT

Input / 1M

$0.2500

Output / 1M

$0.2500

Updated: 2026-03-10 Model Detail

Code Llama 13B Instruct

codellama-2-13b-instruct

FLAT

Input / 1M

$0.3000

Output / 1M

$0.3000

Updated: 2026-03-10 Model Detail

Llama 2 13B Chat

llama-2-13b-chat

FLAT

Input / 1M

$0.3000

Output / 1M

$0.3000

Updated: 2026-03-10 Model Detail

Qwen 1.5 14B Chat

qwen-1-5-14b-chat

FLAT

Input / 1M

$0.3000

Output / 1M

$0.3000

Updated: 2026-03-10 Model Detail

Starcoder 2 15B

starcoder-2-15b

FLAT

Input / 1M

$0.3000

Output / 1M

$0.3000

Updated: 2026-03-10 Model Detail

StarCoder2 15B

starcoder2-15b

FLAT

Input / 1M

$0.3000

Output / 1M

$0.3000

Updated: 2026-03-10 Model Detail

MiniMax M2

minimax-m2

FLAT

Input / 1M

$0.3000

Output / 1M

$1.2000

Updated: 2026-03-12 Model Detail

Firefunction v1

firefunction-v1

FLAT

Input / 1M

$0.5000

Output / 1M

$0.5000

Updated: 2026-03-10 Model Detail

Mixtral 8x7B Instruct

mixtral-8x7b-instruct

FLAT

Input / 1M

$0.5000

Output / 1M

$0.5000

Updated: 2026-03-10 Model Detail

Mixtral 8x7B

mixtral-8x7b

UP

Input / 1M

$0.5000

Output / 1M

$0.5000

Updated: 2026-03-10 Model Detail

DBRX Instruct

dbrx-instruct

FLAT

Input / 1M

$0.6000

Output / 1M

$0.6000

Updated: 2026-03-10 Model Detail

GLM 5

glm-5

FLAT

Input / 1M

$0.6000

Output / 1M

$0.6000

Updated: 2026-03-10 Model Detail

Kimi K2

kimi-k2

FLAT

Input / 1M

$0.6000

Output / 1M

$3.0000

Updated: 2026-03-12 Model Detail

Code Llama 34B Instruct

codellama-2-34b-instruct

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Deepseek Coder 33B Instruct

deepseek-coder-33b-instruct

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Phind Code Llama v2

phind-codellama-34b-v2

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Qwen 1.5 32B Chat

qwen-1-5-32b-chat

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

WizardCoder Python 34B v1.0

wizardcoder-python-34b-v1-0

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Yi 34B 200K Instruct

yi-34b-200k-instruct

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Yi 34B Chat

yi-34b-chat

FLAT

Input / 1M

$0.8000

Output / 1M

$0.8000

Updated: 2026-03-10 Model Detail

Code Llama 70B Instruct

codellama-2-70b-instruct

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

CodeLlama 70B Instruct

codellama-70b-instruct

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Llama 2 70B Chat

llama-2-70b-chat

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Llama 3 70B Instruct

llama-3-70b-instruct

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Qwen 1.5 72B Chat

qwen-1-5-72b-chat

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Qwen 72B Chat

qwen-72b-chat

FLAT

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

DeepSeek V3

deepseek-v3

UP

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Llama 3.3 70B

llama-3-3-70b

UP

Input / 1M

$0.9000

Output / 1M

$0.9000

Updated: 2026-03-10 Model Detail

Mixtral 8x22B Instruct

mixtral-8x22b-instruct

FLAT

Input / 1M

$1.2000

Output / 1M

$1.2000

Updated: 2026-03-10 Model Detail

DeepSeek Coder

deepseek-coder

UP

Input / 1M

$1.2000

Output / 1M

$1.2000

Updated: 2026-03-10 Model Detail

Mistral Large

mistral-large

UP

Input / 1M

$1.2000

Output / 1M

$1.2000

Updated: 2026-03-10 Model Detail

Mixtral 8x22B

mixtral-8x22b

UP

Input / 1M

$1.2000

Output / 1M

$1.2000

Updated: 2026-03-10 Model Detail

Llama 3.1 70B Instruct

llama-3-1-70b-instruct

FLAT

Input / 1M

$1.7500

Output / 1M

$1.7500

Updated: 2026-03-10 Model Detail

DeepSeek R1

deepseek-r1

UP

Input / 1M

$3.0000

Output / 1M

$8.0000

Updated: 2026-03-10 Model Detail

Llama 3.1 405B Instruct

llama-3-1-405b-instruct

FLAT

Input / 1M

$4.0000

Output / 1M

$4.0000

Updated: 2026-03-10 Model Detail

Useful Comparisons From This Vendor

Jump straight into compare pages built from the models visible on this vendor page.

Browse Compare

Premium Report

Need Deeper Pricing Insight?

Get cross-vendor cost intelligence, including hidden token/API costs,
plus full-period change history in CSV format.

Cross-Vendor Benchmark Historical CSV Instant Download
Buy via PayPal - Last Month Report (CSV) — $1,500

Instant download