NVIDIA NIM — Free LLM API

12 free models available — no credit card required. Get API key →

100+ open models from NVIDIA — no credit card, 40 RPM.

NVIDIA NIM (NVIDIA Inference Microservices) provides API access to 100+ open-weight models hosted on NVIDIA infrastructure. The free tier is available to all NVIDIA Developer Program members (free sign-up) with a limit of ~40 requests/minute. Models include Llama, Mistral, DeepSeek-R1, Nemotron, and domain-specific variants. All endpoints are OpenAI-compatible.

100+ open models available
No daily token cap
~40 RPM free tier
No credit card required

API Compatibility: OpenAI SDK-compatible (Chat Completions)

All Free NVIDIA NIM Models — Context Windows & Rate Limits

Model	Context	Max Output	Modality	Rate Limit
Various open models	131K	8K	text	See provider page	Details
deepseek-ai/deepseek-r1	128K	163K	text	~40 RPM	Details
nvidia/llama-3.1-nemotron-ultra-253b-v1	128K	4K	text	~40 RPM	Details
nvidia/nemotron-3-super-120b-a12b	262K	262K	text	~40 RPM	Details
nvidia/nemotron-3-nano-30b-a3b	128K	32K	text	~40 RPM	Details
meta/llama-3.1-405b-instruct	128K	4K	text	~40 RPM	Details
qwen/qwen2.5-72b-instruct	128K	8K	text	~40 RPM	Details
google/gemma-4-31b	128K	8K	text	~40 RPM	Details
mistralai/mistral-large-2-instruct	128K	4K	text	~40 RPM	Details
nvidia/nemotron-nano-2-vl	128K	8K	textimage	~40 RPM	Details
minimax/minimax-m2.7	128K	8K	text	~40 RPM	Details
+ 90 more models	131K	131K	text	~40 RPM	Details

Frequently Asked Questions about NVIDIA NIM Free API

Is NVIDIA NIM free to use?

NVIDIA NIM offers a permanently free tier with 12 available models. No credit card is required to get started — just sign up and generate an API key.

What models does NVIDIA NIM offer for free?

NVIDIA NIM provides 12 free models covering chat, reasoning use cases. Supported modalities include text, image. Browse the full list above with context windows and rate limits.

How do I use NVIDIA NIM with Claude Code or Cursor?

Click "Details" on any model above to get one-click configuration snippets for Claude Code (cc), Cursor, Codex, and more. All NVIDIA NIM models listed here use an OpenAI-compatible endpoint, so any tool that accepts a custom baseURL will work.