17 models available
Nebius Token Factory provides OpenAI-compatible inference APIs for open-weight language, vision, and embedding models across Nebius-hosted regions.
Gemma 2 2B IT
$0.02 / $0.06
Gemma 3 27B IT
$0.10 / $0.30
GPT-OSS 120B
$0.15 / $0.60
Hermes 4 405B
$1.00 / $3.00
Hermes 4 70B
$0.13 / $0.40
INTELLECT-3
$0.20 / $1.10
Llama 3.1 8B Instruct
$0.02 / $0.06
Llama 3.1 Nemotron Ultra 253B
$0.60 / $1.80
Llama 3.3 70B Instruct
$0.13 / $0.40
Nemotron 3 Nano 30B A3B
$0.06 / $0.24
Nemotron 3 Nano Omni
$0.06 / $0.24
Qwen 2.5 VL 72B Instruct
$0.25 / $0.75
Qwen3 235B A22B Instruct
$0.20 / $0.60
Qwen3 30B A3B Instruct
$0.10 / $0.30
Qwen3 32B
$0.10 / $0.30
Qwen3 Embedding 8B
$0.01 / —
Qwen3 Next 80B A3B Thinking
$0.15 / $1.20
You need AI that won’t create compliance headaches. Your data stays in the EU, GDPR is enforced by default, and every request is routed for the best balance of cost, latency, and uptime, reducing risk while improving performance.