Llama 3.1 8B Instant API Pricing & Cost Calculator

Groq-hosted Llama 3.1 8B Instant tier for very low-cost, fast workloads.

Pricing Details

Official source: groq.com

Small Request$0.0001

Medium Request$0.0013

Large Request$0.009

High Volume$0.09

Scenario	Input Tokens	Output Tokens	Input Cost	Output Cost	Total Cost
Small Request	1,000	1,000	$0.0001	$0.0001	$0.0001
Medium Request	10,000	10,000	$0.0005	$0.0008	$0.0013
Large Request	100,000	50,000	$0.005	$0.004	$0.009
High Volume	1,000,000	500,000	$0.05	$0.04	$0.09

Keep Evaluating

Use a workload-first framework instead of picking the most famous model by default.

Estimate token budgets before you commit a feature or pricing plan.

Reduce spend with routing, caching, batching, and model mix strategies.