Z.ai models Call them through one OpenAI-compatible API, billed at base per-token rates.
Last updated June 5, 2026
Per 1M tokens, billed from your credit balance — no markup on usage. Prices load live from our catalogue.
| Model | Provider | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|
| GLM 5 Turbo | Z.ai | 203K | $1.20 | $4.00 |
| GLM 5.1 | Z.ai | 203K | $0.98 | $3.08 |
| GLM 5 | Z.ai | 203K | $0.60 | $1.92 |
| GLM 4.6 | Z.ai | 203K | $0.43 | $1.74 |
| GLM 4.7 | Z.ai | 203K | $0.40 | $1.75 |
| GLM 4.7 Flash | Z.ai | 203K | $0.06 | $0.40 |
Keep the OpenAI SDK; set base_url to Zylo and pick a Z.ai model id.
from openai import OpenAI
client = OpenAI(api_key="ZYLO_KEY", base_url="https://api.zyloai.net/v1")
response = client.chat.completions.create(
model="glm-5-turbo",
messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)
Zylo currently offers 6 Z.ai text models, including GLM 5 Turbo, GLM 5.1, GLM 5, GLM 4.6. See the table above for live per-1M-token pricing.
Each Z.ai model is billed at its base per-token rate, deducted from prepaid credits — no markup on usage. Zylo's 25% platform fee applies only when you add credits.
Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo key, and set the model id (e.g. glm-5-turbo).
Free API key, OpenAI-compatible, local payments.
Get free API key