Back to Models

Llama 3.1 Nemotron Ultra 253B

by nvidia

Chat

Description

NVIDIA-tuned Llama variant built for high-efficiency reasoning, safety, and enterprise-grade performance.

Providers

Pricing

Input$0.60 /1M tokens
Output$1.80 /1M tokens
Get Started

Specifications

Context128K tokens
Parameters
License
Released

Capabilities

ToolsJSON ModeStreaming

EU-Compliant API Access

Use Llama 3.1 Nemotron Ultra 253B with a simple API call. OpenAI-compatible endpoint, EU data residency guaranteed.

JavaScript
const response = await fetch("https://api.eurouter.ai/v1/chat/completions", {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    "Authorization": `Bearer ${process.env.EUROUTER_API_KEY}`,
  },
  body: JSON.stringify({
    model: "nemotron-ultra-253b",
    messages: [
      { role: "user", content: "Hello!" }
    ],
  }),
});

const data = await response.json();
console.log(data.choices[0].message.content);

More from nvidia

Explore more

Integrate AI without GDPR risk.

You need AI that won't create compliance headaches. Your data stays in the EU, GDPR is enforced by default, and every request is routed for the best balance of cost, latency, and uptime — reducing risk while improving performance.

Get Started
GDPR by default
EU data residency
Smart routing
Llama 3.1 Nemotron Ultra 253B API (EU Routed) | EUrouter | EUrouter