Model Details
2025-04Price Data Last Verified: June 23, 2026
GLM-4 32B 128K API Pricing & Cost Calculator
Lower-cost GLM 4 32B long-context model listed on Z.AI pricing.
Pricing Details
- Provider
- Zhipu AI
- Input Price
- $0.10 / 1M tokens
- Output Price
- $0.10 / 1M tokens
- Context Window
- 128K
Official source: docs.z.ai
Cost Scenarios
Small Request$0.0002
Medium Request$0.002
Large Request$0.015
High Volume$0.15
Detailed Cost Breakdown
| Scenario | Input Tokens | Output Tokens | Input Cost | Output Cost | Total Cost |
|---|---|---|---|---|---|
| Small Request | 1,000 | 1,000 | $0.0001 | $0.0001 | $0.0002 |
| Medium Request | 10,000 | 10,000 | $0.001 | $0.001 | $0.002 |
| Large Request | 100,000 | 50,000 | $0.01 | $0.005 | $0.015 |
| High Volume | 1,000,000 | 500,000 | $0.10 | $0.05 | $0.15 |
Other Zhipu AI Models
Keep Evaluating
What to read before you lock in GLM-4 32B 128K
Choose the Right AI Model
Use a workload-first framework instead of picking the most famous model by default.
Token Calculation and Cost Estimation
Estimate token budgets before you commit a feature or pricing plan.
AI API Cost Optimization Guide
Reduce spend with routing, caching, batching, and model mix strategies.