GLM-4.7-FlashX

Z.AI first-party · 200,000 ctx · from $0.07 input / $0.4 output per 1M tokens

Input price ranks #81 of 447 priced models (cheaper = better).
Cheaper alternative with ≥ same context: Qwen3.5-Flash at $0.065 input.

Provider	Z.AI
Family	glm-flash
Context window	200,000
Max output	131,072
Input price	$0.07
Output price	$0.4
Cache read	$0.01
Cache write	$0
Reasoning	Yes
Tool calling	Yes
Vision	No
Open weights	Yes
Knowledge cutoff	2025-04
Released	2026-01-19
Last updated	2026-01-19

Available from 2 curated sources

Source	Input	Output
Z.AI first-party	$0.07	$0.4
Zhipu AI first-party	$0.07	$0.4

Identity: Z.AI; data via models.dev + OpenRouter public APIs. Always confirm on the provider's official pricing page.