Meta logo
Meta

Llama 4 Scout

Call Llama 4 Scout for general-purpose chat, reasoning, and tool use — through one OpenAI-compatible endpoint, with local payments, on the Go plan and up.

Model id llama-4-scout Context 10M tokens Plan Go plan & up Input $0.08 /1M Output $0.30 /1M

Last updated June 5, 2026

Pricing

Per 1M tokens, billed from your credit balance — there is no markup on usage.

DirectionPrice / 1M tokens
Input$0.08
Output$0.30
How billing works. The rate above is what usage costs against your prepaid credits on a paid plan — no per-token markup, and Zylo's flat 25% platform fee applies only when you add credits. The free Basic plan instead gives a daily allowance of Basic-tier models (10 requests/min, no card, no credits). Llama 4 Scout requires the Go plan or higher — the free Basic plan only includes Basic-tier models. Prices update live from our catalogue.

Quickstart

Already using the OpenAI SDK? Change two lines — base_url and your key — and set the model to llama-4-scout.

Terminal
curl https://api.zyloai.net/v1/chat/completions \
  -H "Authorization: Bearer YOUR_ZYLO_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-4-scout",
    "messages": [{"role": "user", "content": "Hello from Zylo!"}]
  }'
Python
# pip install openai
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_ZYLO_KEY",
    base_url="https://api.zyloai.net/v1",
)

response = client.chat.completions.create(
    model="llama-4-scout",
    messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)
Node.js
// npm install openai
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "YOUR_ZYLO_KEY",
  baseURL: "https://api.zyloai.net/v1",
});

const response = await client.chat.completions.create({
  model: "llama-4-scout",
  messages: [{ role: "user", content: "Hello from Zylo!" }],
});
console.log(response.choices[0].message.content);

Migrating?

On OpenRouter this model is meta-llama/llama-4-scout; on Zylo, use the id llama-4-scout.

Frequently asked questions

How much does Llama 4 Scout cost on Zylo?

Llama 4 Scout is billed at its base per-token rate: $0.08 per 1M input tokens and $0.30 per 1M output tokens, deducted from your prepaid credits. There is no markup on usage — Zylo's 25% platform fee applies only when you add credits.

Which plan do I need to use Llama 4 Scout?

Llama 4 Scout requires the Go plan or higher. The free Basic plan only includes Basic-tier models; paid plans (Go and up) add premium models like Llama 4 Scout and include credits you spend on usage at the rate above.

What is the context window of Llama 4 Scout?

Llama 4 Scout supports up to 10M tokens of context through Zylo's OpenAI-compatible API.

Is Llama 4 Scout OpenAI-compatible?

Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo API key, and set the model to llama-4-scout.

How do I switch Llama 4 Scout from OpenRouter to Zylo?

On OpenRouter this model is meta-llama/llama-4-scout. On Zylo, use the bare id llama-4-scout with base URL https://api.zyloai.net/v1 — no vendor prefix.

Related reading

Guides for building on Llama 4 Scout and other models through one API.

Start building with Llama 4 Scout

Llama 4 Scout runs on the Go plan or higher — create an account and upgrade to the Go plan to call it.

Create your account