GLM-4.7-FlashX
Z.AI first-party · 200,000 ctx · from $0.07 input / $0.4 output per 1M tokens
- Input price ranks #81 of 447 priced models (cheaper = better).
- Cheaper alternative with ≥ same context: Qwen3.5-Flash at $0.065 input.
| Provider | Z.AI |
|---|---|
| Family | glm-flash |
| Context window | 200,000 |
| Max output | 131,072 |
| Input price | $0.07 |
| Output price | $0.4 |
| Cache read | $0.01 |
| Cache write | $0 |
| Reasoning | Yes |
| Tool calling | Yes |
| Vision | No |
| Open weights | Yes |
| Knowledge cutoff | 2025-04 |
| Released | 2026-01-19 |
| Last updated | 2026-01-19 |
Available from 2 curated sources
| Source | Input | Output |
|---|---|---|
| Z.AI first-party | $0.07 | $0.4 |
| Zhipu AI first-party | $0.07 | $0.4 |
Identity: Z.AI; data via models.dev + OpenRouter public APIs. Always confirm on the provider's official pricing page.