Meta logo
Meta

Llama 4 Maverick

Call Llama 4 Maverick for general-purpose chat, reasoning, and tool use — through one OpenAI-compatible endpoint, with local payments and a free API key to begin.

Model id llama-4 Context 1M tokens Plan Basic (free) & up Input $0.15 /1M Output $0.60 /1M

Last updated June 5, 2026

Pricing

Per 1M tokens, billed from your credit balance — there is no markup on usage.

DirectionPrice / 1M tokens
Input$0.15
Output$0.60
How billing works. The rate above is what usage costs against your prepaid credits on a paid plan — no per-token markup, and Zylo's flat 25% platform fee applies only when you add credits. The free Basic plan instead gives a daily allowance of Basic-tier models (10 requests/min, no card, no credits). Llama 4 Maverick is a Basic-tier model — callable on the free Basic plan within its daily allowance and a global 10 requests/min limit. Prices update live from our catalogue.

Quickstart

Already using the OpenAI SDK? Change two lines — base_url and your key — and set the model to llama-4.

Terminal
curl https://api.zyloai.net/v1/chat/completions \
  -H "Authorization: Bearer YOUR_ZYLO_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-4",
    "messages": [{"role": "user", "content": "Hello from Zylo!"}]
  }'
Python
# pip install openai
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_ZYLO_KEY",
    base_url="https://api.zyloai.net/v1",
)

response = client.chat.completions.create(
    model="llama-4",
    messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)
Node.js
// npm install openai
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "YOUR_ZYLO_KEY",
  baseURL: "https://api.zyloai.net/v1",
});

const response = await client.chat.completions.create({
  model: "llama-4",
  messages: [{ role: "user", content: "Hello from Zylo!" }],
});
console.log(response.choices[0].message.content);

Migrating?

On OpenRouter this model is meta-llama/llama-4-maverick; on Zylo, use the id llama-4.

Frequently asked questions

How much does Llama 4 Maverick cost on Zylo?

Llama 4 Maverick is billed at its base per-token rate: $0.15 per 1M input tokens and $0.60 per 1M output tokens, deducted from your prepaid credits. There is no markup on usage — Zylo's 25% platform fee applies only when you add credits.

Is Llama 4 Maverick available on the free Basic plan?

Yes. Llama 4 Maverick is a Basic-tier model, so you can call it on the free Basic plan within its daily usage allowance and a global limit of 10 requests per minute. The Basic plan does not include credits; paid plans (Go and up) add premium models and credits you spend on usage.

What is the context window of Llama 4 Maverick?

Llama 4 Maverick supports up to 1M tokens of context through Zylo's OpenAI-compatible API.

Is Llama 4 Maverick OpenAI-compatible?

Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo API key, and set the model to llama-4.

How do I switch Llama 4 Maverick from OpenRouter to Zylo?

On OpenRouter this model is meta-llama/llama-4-maverick. On Zylo, use the bare id llama-4 with base URL https://api.zyloai.net/v1 — no vendor prefix.

Related reading

Guides for building on Llama 4 Maverick and other models through one API.

Start calling Llama 4 Maverick in under 2 minutes

Create a free account and get an API key — no credit card required.

Get free API key