NVIDIA logo
NVIDIA

Nemotron 3 Super

Call Nemotron 3 Super for general-purpose chat, reasoning, and tool use — through one OpenAI-compatible endpoint, with local payments and a free API key to begin.

Model id nemotron Context 1M tokens Plan Basic (free) & up Input $0.09 /1M Output $0.45 /1M

Last updated June 5, 2026

Pricing

Per 1M tokens, billed from your credit balance — there is no markup on usage.

DirectionPrice / 1M tokens
Input$0.09
Output$0.45
How billing works. The rate above is what usage costs against your prepaid credits on a paid plan — no per-token markup, and Zylo's flat 25% platform fee applies only when you add credits. The free Basic plan instead gives a daily allowance of Basic-tier models (10 requests/min, no card, no credits). Nemotron 3 Super is a Basic-tier model — callable on the free Basic plan within its daily allowance and a global 10 requests/min limit. Prices update live from our catalogue.

Quickstart

Already using the OpenAI SDK? Change two lines — base_url and your key — and set the model to nemotron.

Terminal
curl https://api.zyloai.net/v1/chat/completions \
  -H "Authorization: Bearer YOUR_ZYLO_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nemotron",
    "messages": [{"role": "user", "content": "Hello from Zylo!"}]
  }'
Python
# pip install openai
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_ZYLO_KEY",
    base_url="https://api.zyloai.net/v1",
)

response = client.chat.completions.create(
    model="nemotron",
    messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)
Node.js
// npm install openai
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "YOUR_ZYLO_KEY",
  baseURL: "https://api.zyloai.net/v1",
});

const response = await client.chat.completions.create({
  model: "nemotron",
  messages: [{ role: "user", content: "Hello from Zylo!" }],
});
console.log(response.choices[0].message.content);

Migrating?

On OpenRouter this model is nvidia/nemotron-3-super-120b-a12b; on Zylo, use the id nemotron.

Frequently asked questions

How much does Nemotron 3 Super cost on Zylo?

Nemotron 3 Super is billed at its base per-token rate: $0.09 per 1M input tokens and $0.45 per 1M output tokens, deducted from your prepaid credits. There is no markup on usage — Zylo's 25% platform fee applies only when you add credits.

Is Nemotron 3 Super available on the free Basic plan?

Yes. Nemotron 3 Super is a Basic-tier model, so you can call it on the free Basic plan within its daily usage allowance and a global limit of 10 requests per minute. The Basic plan does not include credits; paid plans (Go and up) add premium models and credits you spend on usage.

What is the context window of Nemotron 3 Super?

Nemotron 3 Super supports up to 1M tokens of context through Zylo's OpenAI-compatible API.

Is Nemotron 3 Super OpenAI-compatible?

Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo API key, and set the model to nemotron.

How do I switch Nemotron 3 Super from OpenRouter to Zylo?

On OpenRouter this model is nvidia/nemotron-3-super-120b-a12b. On Zylo, use the bare id nemotron with base URL https://api.zyloai.net/v1 — no vendor prefix.

Related reading

Guides for building on Nemotron 3 Super and other models through one API.

Start calling Nemotron 3 Super in under 2 minutes

Create a free account and get an API key — no credit card required.

Get free API key