NVIDIA logo
NVIDIA

Nemotron 3 Nano 30B A3B

Call Nemotron 3 Nano 30B A3B for ultra-low-cost, high-volume workloads — through one OpenAI-compatible endpoint, with local payments, on the Go plan and up.

Model id nemotron-3-nano-30b-a3b Context 262K tokens Plan Go plan & up Input $0.05 /1M Output $0.20 /1M

Last updated June 5, 2026

Pricing

Per 1M tokens, billed from your credit balance — there is no markup on usage.

DirectionPrice / 1M tokens
Input$0.05
Output$0.20
How billing works. The rate above is what usage costs against your prepaid credits on a paid plan — no per-token markup, and Zylo's flat 25% platform fee applies only when you add credits. The free Basic plan instead gives a daily allowance of Basic-tier models (10 requests/min, no card, no credits). Nemotron 3 Nano 30B A3B requires the Go plan or higher — the free Basic plan only includes Basic-tier models. Prices update live from our catalogue.

Quickstart

Already using the OpenAI SDK? Change two lines — base_url and your key — and set the model to nemotron-3-nano-30b-a3b.

Terminal
curl https://api.zyloai.net/v1/chat/completions \
  -H "Authorization: Bearer YOUR_ZYLO_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nemotron-3-nano-30b-a3b",
    "messages": [{"role": "user", "content": "Hello from Zylo!"}]
  }'
Python
# pip install openai
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_ZYLO_KEY",
    base_url="https://api.zyloai.net/v1",
)

response = client.chat.completions.create(
    model="nemotron-3-nano-30b-a3b",
    messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)
Node.js
// npm install openai
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "YOUR_ZYLO_KEY",
  baseURL: "https://api.zyloai.net/v1",
});

const response = await client.chat.completions.create({
  model: "nemotron-3-nano-30b-a3b",
  messages: [{ role: "user", content: "Hello from Zylo!" }],
});
console.log(response.choices[0].message.content);

Migrating?

On OpenRouter this model is nvidia/nemotron-3-nano-30b-a3b; on Zylo, use the id nemotron-3-nano-30b-a3b.

Frequently asked questions

How much does Nemotron 3 Nano 30B A3B cost on Zylo?

Nemotron 3 Nano 30B A3B is billed at its base per-token rate: $0.05 per 1M input tokens and $0.20 per 1M output tokens, deducted from your prepaid credits. There is no markup on usage — Zylo's 25% platform fee applies only when you add credits.

Which plan do I need to use Nemotron 3 Nano 30B A3B?

Nemotron 3 Nano 30B A3B requires the Go plan or higher. The free Basic plan only includes Basic-tier models; paid plans (Go and up) add premium models like Nemotron 3 Nano 30B A3B and include credits you spend on usage at the rate above.

What is the context window of Nemotron 3 Nano 30B A3B?

Nemotron 3 Nano 30B A3B supports up to 262K tokens of context through Zylo's OpenAI-compatible API.

Is Nemotron 3 Nano 30B A3B OpenAI-compatible?

Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo API key, and set the model to nemotron-3-nano-30b-a3b.

How do I switch Nemotron 3 Nano 30B A3B from OpenRouter to Zylo?

On OpenRouter this model is nvidia/nemotron-3-nano-30b-a3b. On Zylo, use the bare id nemotron-3-nano-30b-a3b with base URL https://api.zyloai.net/v1 — no vendor prefix.

Related reading

Guides for building on Nemotron 3 Nano 30B A3B and other models through one API.

Start building with Nemotron 3 Nano 30B A3B

Nemotron 3 Nano 30B A3B runs on the Go plan or higher — create an account and upgrade to the Go plan to call it.

Create your account