NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 Token Price

Model ID: nvidia/llama-3.1-nemotron-ultra-253b-v1

Input Price

$0.60

per 1M tokens

Output Price

$1.80

per 1M tokens

Integration Examples

Use NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 in your application

import requests

# Fetch pricing data from llmprices.ai
response = requests.get(
    "https://llmprices.ai/api/pricing?model=nvidia/llama-3.1-nemotron-ultra-253b-v1"
)

data = response.json()

print(f"Model: {data['name']}")
print(f"Input: {float(data["pricing"]["prompt"]) * 1000000:.2f}/1M tokens")
print(f"Output: {float(data["pricing"]["completion"]) * 1000000:.2f}/1M tokens")

Pricing API

Get real-time pricing data programmatically

Endpoint:

GET https://llmprices.ai/api/pricing?model=nvidia/llama-3.1-nemotron-ultra-253b-v1

Example Response:

{
  "id": "nvidia/llama-3.1-nemotron-ultra-253b-v1",
  "name": "NVIDIA: Llama 3.1 Nemotron Ultra 253B v1",
  "pricing": {
    "prompt": "0.0000006",
    "completion": "0.0000018"
  }
}