Llama 3.1 8B
llama-3-1-8b
Input / 1M
$0.0200
Output / 1M
$0.0300
Vendor Detail
Observed model pricing snapshots and freshness signals for this vendor.
Why Trust This Page
DeepInfra is presented here as a vendor pricing reference page first: review model count, freshness, and the current lowest observed prices, then move into model-level pages for exact comparisons.
The Evidence page provides the latest verification surface for this vendor. Final billing terms still need to be confirmed against the official pricing page and the vendor-specific offering conditions.
llama-3-1-8b
Input / 1M
$0.0200
Output / 1M
$0.0300
llama-3-2-3b
Input / 1M
$0.0200
Output / 1M
$0.0200
mistral-nemo
Input / 1M
$0.0200
Output / 1M
$0.0400
llama-3-8b
Input / 1M
$0.0300
Output / 1M
$0.0600
gemma-3-4b
Input / 1M
$0.0400
Output / 1M
$0.0800
gpt-oss-20b
Input / 1M
$0.0400
Output / 1M
$0.1500
lunaris-8b
Input / 1M
$0.0400
Output / 1M
$0.0500
nvidia-nemotron-nano-9b-v2
Input / 1M
$0.0400
Output / 1M
$0.1600
qwen-2-5-7b
Input / 1M
$0.0400
Output / 1M
$0.1000
llama-3-2-11b-vision
Input / 1M
$0.0490
Output / 1M
$0.0490
gemma-3-12b
Input / 1M
$0.0500
Output / 1M
$0.1000
gpt-oss-120b
Input / 1M
$0.0500
Output / 1M
$0.4500
mistral-small
Input / 1M
$0.0500
Output / 1M
$0.0800
llama-guard-3-8b
Input / 1M
$0.0550
Output / 1M
$0.0550
qwen-3-14b
Input / 1M
$0.0600
Output / 1M
$0.2400
phi-4
Input / 1M
$0.0700
Output / 1M
$0.1400
llama-4-scout
Input / 1M
$0.0800
Output / 1M
$0.3000
mythomax-l2-13b
Input / 1M
$0.0800
Output / 1M
$0.0900
qwen-3-30b-a3b
Input / 1M
$0.0800
Output / 1M
$0.2900
gemma-3-27b
Input / 1M
$0.0900
Output / 1M
$0.1600
gemini-2-0-flash
Input / 1M
$0.1000
Output / 1M
$0.4000
qwen-3-32b
Input / 1M
$0.1000
Output / 1M
$0.2800
qwen-2-5-72b
Input / 1M
$0.1200
Output / 1M
$0.3900
qwen-3-next-80b-a3b
Input / 1M
$0.1400
Output / 1M
$1.4000
llama-4-maverick
Input / 1M
$0.1500
Output / 1M
$0.6000
qwq-32b
Input / 1M
$0.1500
Output / 1M
$0.4000
llama-guard-4-12b
Input / 1M
$0.1800
Output / 1M
$0.1800
qwen-3-235b-a22b
Input / 1M
$0.1800
Output / 1M
$0.5400
qwen-2-5-32b
Input / 1M
$0.2000
Output / 1M
$0.6000
llama-3-3-70b
Input / 1M
$0.2300
Output / 1M
$0.4000
olmocr-7b
Input / 1M
$0.2700
Output / 1M
$1.5000
gemini-2-5-flash
Input / 1M
$0.3000
Output / 1M
$2.5000
deepseek-v3
Input / 1M
$0.3800
Output / 1M
$0.8900
glm-4-5
Input / 1M
$0.4000
Output / 1M
$1.6000
mixtral-8x7b
Input / 1M
$0.4000
Output / 1M
$0.4000
qwen-3-coder-480b-a35b
Input / 1M
$0.4000
Output / 1M
$1.6000
wizardlm-2-8x22b
Input / 1M
$0.4800
Output / 1M
$0.4800
kimi-k2-instruct
Input / 1M
$0.5000
Output / 1M
$2.0000
llama-3-1-70b
Input / 1M
$0.6000
Output / 1M
$0.6000
deepseek-r1
Input / 1M
$0.7000
Output / 1M
$2.4000
llama-3-1-405b
Input / 1M
$1.0000
Output / 1M
$1.0000
gemini-2-5-pro
Input / 1M
$1.2500
Output / 1M
$10.0000
claude-3-7-sonnet
Input / 1M
$3.3000
Output / 1M
$16.5000
claude-4-sonnet
Input / 1M
$3.3000
Output / 1M
$16.5000
claude-4-opus
Input / 1M
$16.5000
Output / 1M
$82.5000
Jump straight into compare pages built from the models visible on this vendor page.
Premium Report
Get cross-vendor cost intelligence, including hidden token/API costs,
plus full-period change history in CSV format.
Instant download