DeepSeek-R1-Distill-Llama-70B — Free API

Created by DeepSeek
nscale/deepseek-r1-distill-llama-70b
chat reasoning

What is DeepSeek-R1-Distill-Llama-70B?

DeepSeek R1 Distill Llama 70B on OVHcloud AI Endpoints combines DeepSeek's reasoning distillation with Llama's 70B architecture, delivered from OVHcloud's European infrastructure. With chain-of-thought reasoning, 131K context, and 32K output, it is well-suited for complex analytical tasks that benefit from step-by-step deliberation. OpenAI-compatible API; registration required. A good choice for EU developers who need reasoning capabilities with GDPR-compliant hosting.

Model ID
deepseek-r1-distill-llama-70b
Base URL
https://inference.api.nscale.com/v1

DeepSeek-R1-Distill-Llama-70B API Code Example

Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.

from openai import OpenAI

client = OpenAI(
    base_url="https://inference.api.nscale.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="deepseek-r1-distill-llama-70b",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://inference.api.nscale.com/v1",
  apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
  model: "deepseek-r1-distill-llama-70b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);
curl https://inference.api.nscale.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "deepseek-r1-distill-llama-70b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Other Free Models from Nscale

More About Nscale

How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Nscale as a free LLM API backend.

View Nscale full guide →