Model Directory
32 ModelsPrice Data Last Verified: April 22, 2026
All AI Models Pricing Directory
Browse our complete database of AI model pricing. Click any model to see detailed cost breakdowns, context window information, and usage scenarios. Data is updated weekly from official provider documentation.
Quick Reference Table
| Model | Provider | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| GPT-5.4 | OpenAI | $2.50 | $15.00 | 1M |
| GPT-5.4 mini | OpenAI | $0.75 | $4.50 | 1M |
| GPT-5.4 nano | OpenAI | $0.20 | $1.25 | 1M |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1M |
| GPT-4.1 mini | OpenAI | $0.40 | $1.60 | 1M |
| GPT-4.1 nano | OpenAI | $0.10 | $0.40 | 1M |
| GPT-4o | OpenAI | $2.50 | $10.00 | 128K |
| GPT-4o mini | OpenAI | $0.15 | $0.60 | 128K |
| Claude Opus 4.1 | Anthropic | $15.00 | $75.00 | 200K |
| Claude Opus 4 | Anthropic | $15.00 | $75.00 | 200K |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 3.7 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 3.5 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Haiku 3.5 | Anthropic | $0.80 | $4.00 | 200K |
| Claude Haiku 3 | Anthropic | $0.25 | $1.25 | 200K |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1M | |
| Gemini 2.5 Flash | $0.15 | $0.60 | 1M | |
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | 1M | |
| DeepSeek V3.2 | DeepSeek | $0.28 | $0.42 | 128K |
| DeepSeek Reasoner | DeepSeek | $0.28 | $0.42 | 128K |
| Kimi K2.5 | Moonshot AI | $0.5797 | $3.0435 | 128K |
| Kimi K2 0905 | Moonshot AI | $0.5797 | $2.3188 | 128K |
| MiniMax M2.7 | MiniMax | $0.30 | $1.20 | 204.8K |
| MiniMax M2.7 Highspeed | MiniMax | $0.60 | $2.40 | 204.8K |
| Mistral Small 4 | Mistral | $0.15 | $0.60 | 128K |
| Mistral Medium 3.1 | Mistral | $0.40 | $2.00 | 128K |
| Mistral Small 3.2 | Mistral | $0.10 | $0.30 | 128K |
| Grok 4.20 | xAI | $2.00 | $6.00 | 256K |
| Grok 4.1 Fast | xAI | $0.20 | $0.50 | 256K |
| GPT OSS 120B | Groq | $0.15 | $0.60 | 131K |
| Qwen QwQ 32B | Groq | $0.29 | $0.59 | 128K |
| Llama 3.3 70B Versatile | Groq | $0.59 | $0.79 | 128K |
Models by Provider
Anthropic (7 models)
Claude Opus 4.1Popular2025
200K context
Best for complex reasoning
In: $15.00Out: $75.00
Claude Opus 42025
200K context
In: $15.00Out: $75.00
Claude Sonnet 4Popular2025
200K context
Best for coding & development
In: $3.00Out: $15.00
Claude Sonnet 3.72025
200K context
Recommended for production
In: $3.00Out: $15.00
Claude Sonnet 3.52024
200K context
In: $3.00Out: $15.00
Claude Haiku 3.5Popular2024
200K context
Best for fast responses
In: $0.80Out: $4.00
Claude Haiku 32024
200K context
Best value for money
In: $0.25Out: $1.25
DeepSeek (2 models)
Google (3 models)
Groq (3 models)
MiniMax (2 models)
Mistral (3 models)
Moonshot AI (2 models)
OpenAI (8 models)
GPT-5.4Popular2026
1M context
Best for complex reasoning
In: $2.50Out: $15.00
GPT-5.4 miniPopular2026
1M context
Recommended for production
In: $0.75Out: $4.50
GPT-5.4 nano2026
1M context
Best for fast responses
In: $0.20Out: $1.25
GPT-4.1Popular2025
1M context
Best for coding & development
In: $2.00Out: $8.00
GPT-4.1 miniPopular2025
1M context
Best value for money
In: $0.40Out: $1.60
GPT-4.1 nano2025
1M context
In: $0.10Out: $0.40
GPT-4oPopular2024
128K context
Recommended for production
In: $2.50Out: $10.00
GPT-4o miniPopular2024
128K context
Best for fast responses
In: $0.15Out: $0.60
Data Sources & Methodology
Verification Status
Last VerifiedApril 22, 2026
Update FrequencyWeekly
Data SourceOfficial provider pricing pages
Normalization
Moonshot AI prices are published in CNY and converted to USD here using a 6.90 CNY/USD snapshot for April 22, 2026.
Official Pricing Pages
Disclaimer: Prices are subject to change. Always verify current pricing on official provider websites before making business decisions.
Next Step
Use the pricing directory to make a better model shortlist
Choose the Right AI Model
Use a workload-first framework instead of picking the most famous model by default.
Token Calculation and Cost Estimation
Estimate token budgets before you commit a feature or pricing plan.
AI API Cost Optimization Guide
Reduce spend with routing, caching, batching, and model mix strategies.