Model Details
2025-08Price Data Last Verified: April 22, 2026
GPT OSS 120B API Pricing & Cost Calculator
Groq-hosted open model option with strong price-to-latency characteristics for production traffic.
Pricing Details
- Provider
- Groq
- Input Price
- $0.15 / 1M tokens
- Output Price
- $0.60 / 1M tokens
- Context Window
- 131K
Cost Scenarios
Small Request$0.0007
Medium Request$0.0075
Large Request$0.045
High Volume$0.45
Detailed Cost Breakdown
| Scenario | Input Tokens | Output Tokens | Input Cost | Output Cost | Total Cost |
|---|---|---|---|---|---|
| Small Request | 1,000 | 1,000 | $0.0002 | $0.0006 | $0.0007 |
| Medium Request | 10,000 | 10,000 | $0.0015 | $0.006 | $0.0075 |
| Large Request | 100,000 | 50,000 | $0.015 | $0.03 | $0.045 |
| High Volume | 1,000,000 | 500,000 | $0.15 | $0.30 | $0.45 |
Other Groq Models
Keep Evaluating
What to read before you lock in GPT OSS 120B
GPT OSS 120B vs DeepSeek V3.2
See the decision-ready pricing breakdown between GPT OSS 120B and DeepSeek V3.2.
Choose the Right AI Model
Use a workload-first framework instead of picking the most famous model by default.
Token Calculation and Cost Estimation
Estimate token budgets before you commit a feature or pricing plan.
AI API Cost Optimization Guide
Reduce spend with routing, caching, batching, and model mix strategies.