NVIDIA on Zylo

NVIDIA models & pricing

NVIDIA models Call them through one OpenAI-compatible API, billed at base per-token rates.

Last updated June 5, 2026

NVIDIA model pricing

Per 1M tokens, billed from your credit balance — no markup on usage. Prices load live from our catalogue.

ModelProviderContextInput / 1MOutput / 1M
Nemotron 3 SuperNVIDIA1M$0.09$0.45
Nemotron 3 Nano 30B A3BNVIDIA262K$0.05$0.20
How pricing works. The number is the base rate per 1M tokens, deducted from prepaid credits — no usage markup. Zylo's flat 25% platform fee applies only when you add credits.

Quickstart

Keep the OpenAI SDK; set base_url to Zylo and pick a NVIDIA model id.

Python
from openai import OpenAI

client = OpenAI(api_key="ZYLO_KEY", base_url="https://api.zyloai.net/v1")

response = client.chat.completions.create(
    model="nemotron",
    messages=[{"role": "user", "content": "Hello from Zylo!"}],
)
print(response.choices[0].message.content)

All NVIDIA models

Frequently asked questions

Which NVIDIA models can I use on Zylo?

Zylo currently offers 2 NVIDIA text models, including Nemotron 3 Super, Nemotron 3 Nano 30B A3B. See the table above for live per-1M-token pricing.

How much do NVIDIA models cost on Zylo?

Each NVIDIA model is billed at its base per-token rate, deducted from prepaid credits — no markup on usage. Zylo's 25% platform fee applies only when you add credits.

Are NVIDIA models OpenAI-compatible on Zylo?

Yes. Point any OpenAI SDK at https://api.zyloai.net/v1, use your Zylo key, and set the model id (e.g. nemotron).

Call NVIDIA models in under 2 minutes

Free API key, OpenAI-compatible, local payments.

Get free API key