Free LLM API Directory (2026): Browse 312+ Models

Discover and filter 312+ free LLM models across 30 providers. Find APIs by capability (vision, reasoning), rate limits, or no-credit-card requirements, and get the perfect free AI model for your project.

205 models verified via live API · refreshed Jun 30, 2026 — how we verify

Provider	Model	Score	Context	Modality	Rate Limit	Released	Weekly Tokens	Status
OpenRouter	MiniMax: MiniMax M3 Paid MiniMax: MiniMax M3	93	1.0M	text👁️ imagevideo🧠 reasoning	200 req/day (free tier)	Jun 1, 2026	3.7T	Online
OpenRouter	Nex AGI: Nex-N2-Pro Paid Nex AGI: Nex-N2-Pro	91	262K	text👁️ image	200 req/day (free tier)	Jun 2, 2026	434.3M	Online
Google Gemini	Gemini 3.5 Flash Gemini 3.5 Flash	90	1.0M	text👁️ imagevideoaudiopdf🧠 reasoning	15 RPM, 1,500 RPD	May 19, 2026	—	Online
NVIDIA NIM	minimaxai/minimax-m3 Verified MiniMax: MiniMax M3	89	1.0M	text👁️ imagevideo🧠 reasoning	Up to 40 RPM	Jun 1, 2026	—	Online
OpenRouter	DeepSeek: DeepSeek V4 Flash Paid DeepSeek: DeepSeek V4 Flash	88	1.0M	text🧠 reasoning	200 req/day (free tier)	Apr 24, 2026	4.7T	Online
OpenRouter	NVIDIA: Nemotron 3 Ultra (free) Verified NVIDIA: Nemotron 3 Ultra (free)	88	1.0M	text🧠 reasoning	200 req/day (free tier)	Jun 4, 2026	777.1B	Online
OpenRouter	MoonshotAI: Kimi K2.6 Paid MoonshotAI: Kimi K2.6	87	262K	text👁️ imagevideo🧠 reasoning	200 req/day (free tier)	Apr 20, 2026	2.4B	Online
NVIDIA NIM	deepseek-ai/deepseek-v4-pro Verified deepseek-ai/deepseek-v4-pro	86	1.0M	text🧠 reasoning	Up to 40 RPM	Apr 24, 2026	—	Online
OpenRouter	Z.ai: GLM 5.1 Paid	85	203K	text	200 req/day (free tier)	Apr 7, 2026	158.1B	Online
NVIDIA NIM	deepseek-ai/deepseek-v4-flash Verified DeepSeek: DeepSeek V4 Flash	83	1.0M	text🧠 reasoning	Up to 40 RPM	Apr 24, 2026	—	Online
NVIDIA NIM	moonshotai/kimi-k2.6 Verified MoonshotAI: Kimi K2.6	82	262K	text👁️ imagevideo🧠 reasoning	Up to 40 RPM	Apr 20, 2026	2.4B	Online
Agnes AI	agnes-2.0-flash Verified	81	256K	text👁️ vision	30 RPM	Jun 30, 2026	—	Online
OVHcloud AI Endpoints	Qwen3.6-27B Qwen3.6-27B	81	131K	text👁️ imagevideoaudio🧠 reasoning	2 RPM (anonymous)	Apr 22, 2026	—	Online
NVIDIA NIM	z-ai/glm-5.1 Verified	80	203K	text	Up to 40 RPM	Apr 7, 2026	158.1B	Online
OpenRouter	inclusionAI: Ring-2.6-1T Paid	80	262K	text	200 req/day (free tier)	May 8, 2026	6.0B	Online
OpenRouter	Cohere: North Mini Code (free) Verified Cohere: North Mini Code (free)	79	256K	textcode	200 req/day (free tier)	Jun 9, 2026	121.2B	Online
OpenRouter	Nemotron 3 Ultra 550B A55B Verified NVIDIA: Nemotron 3 Ultra (free)	77	1.0M	🧠 reasoning	200 req/day (free tier)	Jun 4, 2026	—	Online
NVIDIA NIM	minimaxai/minimax-m2.7 Verified MiniMax: MiniMax M3	76	205K	text🧠 reasoning	Up to 40 RPM	Mar 18, 2026	—	Online
NVIDIA NIM	stepfun-ai/step-3.7-flash Verified	76	256K	text👁️ image🧠 reasoning	Up to 40 RPM	May 29, 2026	—	Online
ModelScope	Qwen/Qwen3.5-27B Qwen/Qwen3.5-27B	76	131K	text👁️ imagevideoaudio🧠 reasoning	2,000 RPD total; <=500 RPD/model (dynamic)	Feb 24, 2026	—	Online
OpenRouter	Google: Gemma 4 31B (free) Verified Google: Gemma 4 31B (free)	76	262K	text👁️ image🧠 reasoning	200 req/day (free tier)	Apr 2, 2026	37.2B	Online
OpenRouter	MiniMax: MiniMax M2.5 Paid MiniMax: MiniMax M3	75	205K	text🧠 reasoning	200 req/day (free tier)	Feb 12, 2026	4.3B	Online
ModelScope	deepseek-ai/DeepSeek-V4-Pro Verified deepseek-ai/deepseek-v4-pro	73	8K			Apr 24, 2026	—	Online
OpenRouter	Google: Gemma 4 26B A4B (free) Verified Google: Gemma 4 26B A4B (free)	73	262K	text👁️ image🧠 reasoning	200 req/day (free tier)	Apr 2, 2026	4.0B	Online
OVHcloud AI Endpoints	Qwen3.5-9B Qwen3.5-9B	73	131K	text🧠 reasoning	2 RPM (anonymous)	Mar 2, 2026	—	Online
SambaNova	MiniMax-M2.7 MiniMax-M2.7	73	128K	text🧠 reasoning	20 RPM, 20 RPD, 200K TPD	Mar 18, 2026	—	Online
Kilo Code	minimax/minimax-m2.5:free MiniMax: MiniMax M3	72	196K	text🧠 reasoning	~200 req/hr	Feb 12, 2026	—	Online
OpenRouter	Arcee AI: Trinity Large Thinking Paid	72	262K	text🧠 reasoning	200 req/day (free tier)	Apr 1, 2026	1.7B	Online
OVHcloud AI Endpoints	Qwen3.5-397B-A17B Qwen3.5-397B-A17B	72	131K	text👁️ imagevideoaudio🧠 reasoning	2 RPM (anonymous)	Feb 16, 2026	—	Online
NVIDIA NIM	qwen/qwen3.5-397b-a17b Verified Qwen3.5-397B-A17B	71	256K	text👁️ imagevideoaudio🧠 reasoning	Up to 40 RPM	Feb 16, 2026	30.1B	Online
ModelScope	Qwen/Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B	71	131K	text👁️ imagevideoaudio🧠 reasoning	2,000 RPD total; <=500 RPD/model (dynamic)	Feb 24, 2026	—	Online
OpenRouter	NVIDIA: Nemotron 3 Nano Omni (free) Verified NVIDIA: Nemotron 3 Nano Omni (free)	71	256K	text👁️ imageaudiovideo🧠 reasoning	200 req/day (free tier)	Apr 28, 2026	18.1B	Online
OpenRouter	NVIDIA: Nemotron 3 Super (free) Verified NVIDIA: Nemotron 3 Super (free)	71	1.0M	text🧠 reasoning	200 req/day (free tier)	Mar 11, 2026	404.8B	Online
NVIDIA NIM	qwen/qwen3.5-122b-a10b Verified qwen/qwen3.5-122b-a10b	71	262K	text👁️ imagevideoaudio🧠 reasoning	Up to 40 RPM	Feb 24, 2026	6.1B	Online
ModelScope	deepseek-ai/DeepSeek-V4-Flash Verified DeepSeek: DeepSeek V4 Flash	70	8K			Apr 24, 2026	—	Online
Kilo Code	arcee-ai/trinity-large-thinking:free	69	131K	text🧠 reasoning	~200 req/hr	Apr 1, 2026	—	Online
OpenRouter	Poolside: Laguna XS.2 (free) Verified	68	262K	text	200 req/day (free tier)	Apr 28, 2026	107.7B	Online
OpenRouter	Poolside: Laguna M.1 (free) Verified	68	262K	text	200 req/day (free tier)	Apr 28, 2026	679.2B	Online
Kilo Code	nvidia/nemotron-3-super-120b-a12b:free NVIDIA: Nemotron 3 Super (free)	68	262K	text🧠 reasoning	~200 req/hr	Mar 11, 2026	—	Online
ModelScope	Qwen/Qwen3.5-27B Verified Qwen/Qwen3.5-27B	67	8K			Feb 25, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-4-26b-a4b-it Google: Gemma 4 26B A4B (free)	66	256K	text👁️ image🧠 reasoning	10K neurons/day (shared)	Apr 2, 2026	—	Online
Ollama Cloud	deepseek-v3.1:671b-cloud	65	128K	text	Session/weekly limits (unpublished)	May 23, 2026	—	Online
OpenRouter	Gemma 4 31B IT Verified Google: Gemma 4 31B (free)	65	262K	👁️ vision🧠 reasoning	200 req/day (free tier)	Apr 2, 2026	—	Online
Cloudflare Workers AI	@cf/moonshotai/kimi-k2.7-code MoonshotAI: Kimi K2.6	64	262K	textcode👁️ imagevideo🧠 reasoning	10K neurons/day (shared)	Jun 12, 2026	—	Online
NVIDIA NIM	stepfun-ai/step-3.5-flash Verified	64	262K	text🧠 reasoning	Up to 40 RPM	Feb 2, 2026	—	Online
Z AI (Zhipu AI)	GLM-4.7-Flash GLM-4.7-Flash	64	200K	text🧠 reasoning	1 concurrent request	Jan 19, 2026	—	Online
OpenRouter	Gemma 4 26B A4B IT Verified Google: Gemma 4 26B A4B (free)	63	262K	👁️ vision🧠 reasoning	200 req/day (free tier)	Apr 3, 2026	—	Online
Z AI (Zhipu AI)	GLM-4.6V-Flash GLM-4.6V-Flash	63	128K	text👁️ imagevideo🧠 reasoning	1 concurrent request	Dec 8, 2025	—	Online
ModelScope	Qwen/Qwen3.5-35B-A3B Verified Qwen/Qwen3.5-35B-A3B	62	8K			Feb 25, 2026	—	Online
Google Gemini	Gemini 3.1 Flash-Lite Gemini 3.1 Flash-Lite	62	1.0M	text👁️ imagevideoaudiopdf🧠 reasoning	30 RPM, 1,500 RPD	Mar 3, 2026	—	Online
GitHub Models	o4-mini o4-mini	62	200K	text👁️ image🧠 reasoning	10 RPM, 50 RPD	Apr 16, 2025	—	Online
Mistral AI	Mistral Small 4	61	256K	text	~1 RPS, 500K TPM	Mar 16, 2026	—	Online
Kilo Code	bytedance-seed/dola-seed-2.0-pro:free	61	131K	text	~200 req/hr	May 10, 2026	—	Online
Google Gemini	Gemma 4 31B IT Verified Google: Gemma 4 31B (free)	61	262K	👁️ vision🧠 reasoning		Apr 2, 2026	—	Online
ModelScope	deepseek-ai/DeepSeek-V3.2 Verified deepseek-ai/DeepSeek-V3.2	61	8K			Dec 1, 2025	—	Online
Ollama Cloud	deepseek-r1:cloud deepseek-r1:cloud	61	128K	text🧠 reasoning	Session/weekly limits (unpublished)	Jan 20, 2025	—	Online
Ollama Cloud	qwen3-coder:480b-cloud	60	128K	textcode	Session/weekly limits (unpublished)	May 23, 2026	—	Online
OpenRouter	Nemotron 3 Super 120B A12B Verified NVIDIA: Nemotron 3 Super (free)	60	262K	🧠 reasoning	200 req/day (free tier)	Mar 11, 2026	—	Online
Cerebras	zai-glm-4.7 zai-glm-4.7	60	128K	text🧠 reasoning	10 RPM, 100 RPD, 1M TPD	Dec 22, 2025	—	Online
SambaNova	DeepSeek-V3.2 (Preview)	59	128K	text	20 RPM, 20 RPD, 200K TPD	Jun 17, 2026	—	Online
OpenRouter	Nous: Hermes 3 405B Instruct (free) Verified	59	131K	text	200 req/day (free tier)	Aug 16, 2024	53.6M	Online
GitHub Models	AI21 Jamba 1.5 Large	59	256K	text	See provider page	Jun 27, 2026	—	Online
Google Gemini	Gemma 4 26B A4B IT Verified Google: Gemma 4 26B A4B (free)	59	262K	👁️ vision🧠 reasoning		Apr 3, 2026	—	Online
ModelScope	Qwen/Qwen3.5-397B-A17B Verified Qwen3.5-397B-A17B	59	8K			Feb 16, 2026	—	Online
NVIDIA NIM	Nemotron 3 Nano Omni 30B A3B Reasoning Verified NVIDIA: Nemotron 3 Nano Omni (free)	59	256K	👁️ visionaudio🧠 reasoning		Apr 28, 2026	—	Online
ModelScope	Qwen/Qwen3.5-122B-A10B Verified qwen/qwen3.5-122b-a10b	59	8K			Feb 25, 2026	—	Online
OpenRouter	Qwen: Qwen3 Coder 480B A35B (free) Verified Qwen: Qwen3 Coder 480B A35B (free)	59	1.0M	textcode	200 req/day (free tier)	Jul 23, 2025	0	Online
GitHub Models	gpt-4.1 gpt-4.1	58	1.0M	text👁️ imagepdf	10 RPM, 50 RPD	Apr 14, 2025	—	Online
OpenRouter	OpenAI: gpt-oss-120b (free) Verified OpenAI: gpt-oss-120b (free)	58	131K	text🧠 reasoning	200 req/day (free tier)	Aug 5, 2025	132.8B	Online
OpenRouter	NVIDIA: Nemotron 3.5 Content Safety (free) Verified NVIDIA: Nemotron 3.5 Content Safety (free)	58	128K	text👁️ image🧠 reasoning	200 req/day (free tier)	Jun 4, 2026	1.1B	Online
SambaNova	gemma-4-31B-it (Preview) gemma-4-31B-it (Preview)	57	128K	text👁️ image🧠 reasoning	20 RPM, 20 RPD, 200K TPD	Apr 2, 2026	—	Online
Cloudflare Workers AI	@cf/openai/gpt-oss-120b	56	128K	text	10K neurons/day (shared)	Jun 17, 2026	—	Online
Kilo Code	x-ai/grok-code-fast-1:free	56	256K	textcode	~200 req/hr	Aug 28, 2025	—	Online
Cloudflare Workers AI	@cf/google/gemma-4-26b-a4b-it Verified Google: Gemma 4 26B A4B (free)	56	8K			Apr 3, 2026	—	Online
GitHub Models	gpt-4.1-mini gpt-4.1-mini	56	1.0M	text👁️ imagepdf	15 RPM, 150 RPD	Apr 14, 2025	—	Online
LLM7.io	deepseek-r1-0528 deepseek-r1-0528	56	131K	text🧠 reasoning	30 RPM (120 with token)	May 28, 2025	—	Online
OpenRouter	Qwen: Qwen3 Next 80B A3B Instruct (free) Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	56	262K	text	200 req/day (free tier)	Sep 11, 2025	274.9M	Online
GitHub Models	gpt-5 gpt-5	56	200K	text👁️ image🧠 reasoning	10 RPM, 50 RPD	Aug 7, 2025	—	Online
Cloudflare Workers AI	@cf/zhipuai/glm-4.7-flash @cf/zhipuai/glm-4.7-flash	56	131K	text🧠 reasoning	10K neurons/day (shared)	Jan 19, 2026	—	Online
Ollama Cloud	gpt-oss:120b-cloud gpt-oss:120b-cloud	56	128K	text🧠 reasoning	Session/weekly limits (unpublished)	Aug 5, 2025	—	Online
OpenRouter	Google: Lyria 3 Pro Preview Verified Google: Lyria 3 Pro Preview	56	1.0M	text👁️ image	200 req/day (free tier)	Mar 30, 2026	6.0M	Online
OpenRouter	Google: Lyria 3 Clip Preview Verified Google: Lyria 3 Clip Preview	56	1.0M	text👁️ image	200 req/day (free tier)	Mar 30, 2026	4.9M	Online
Agnes AI	agnes-1.5-flash Verified	55	256K	text👁️ vision	30 RPM	Jun 30, 2026	—	Online
OVHcloud AI Endpoints	Qwen3-Coder-30B-A3B-Instruct Qwen3-Coder-30B-A3B-Instruct	55	262K	textcode	2 RPM (anonymous)	Jul 31, 2025	—	Online
Ollama Cloud	kimi-k2:1t-cloud	54	262K	text	Session/weekly limits (unpublished)	May 23, 2026	—	Online
Google Gemini	Gemini 3.1 Flash Lite Verified Gemini 3.1 Flash-Lite	54	1.0M	👁️ visionaudio🧠 reasoning		May 7, 2026	—	Online
Google Gemini	Gemini 3.1 Flash Lite Verified Gemini 3.1 Flash-Lite	54	1.0M	👁️ visionaudio🧠 reasoning		May 7, 2026	—	Online
Cerebras	gpt-oss-120b gpt-oss:120b-cloud	54	128K	text🧠 reasoning	30 RPM, 14,400 RPD, 1M TPD	Aug 5, 2025	—	Online
OpenRouter	NVIDIA: Nemotron 3 Nano 30B A3B (free) Verified NVIDIA: Nemotron 3 Nano 30B A3B (free)	54	256K	text🧠 reasoning	200 req/day (free tier)	Dec 14, 2025	34.2B	Online
OpenRouter	Z.ai: GLM 4.5 Air Paid Z.ai: GLM 4.5 Air	54	131K	text🧠 reasoning	200 req/day (free tier)	Jul 28, 2025	1.2B	Online
NVIDIA NIM	mistralai/mistral-large-3-675b-instruct-2512 Verified	53	8K			Jun 29, 2026	—	Online
OpenRouter	GPT OSS 120B Verified	53	131K	🧠 reasoning	200 req/day (free tier)	Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-3.5-content-safety Verified NVIDIA: Nemotron 3.5 Content Safety (free)	53	128K	text👁️ image🧠 reasoning	Up to 40 RPM	Jun 4, 2026	1.1B	Online
OpenRouter	NVIDIA: Nemotron Nano 12B 2 VL (free) Verified NVIDIA: Nemotron Nano 12B 2 VL (free)	53	128K	text👁️ imagevideo🧠 reasoning	200 req/day (free tier)	Oct 28, 2025	8.1B	Online
SambaNova	DeepSeek-V3.1 DeepSeek-V3.1	53	128K	text	20 RPM, 20 RPD, 200K TPD	Aug 21, 2025	—	Online
Mistral AI	Mistral Medium 3.5 (128B)	52	256K	text	~1 RPS, 500K TPM	Jun 17, 2026	—	Online
Groq	GPT-OSS 120B Paid	52	131K	text	See provider page	Aug 5, 2025	132.8B	Online
NVIDIA NIM	nvidia/nemoretriever-parse Verified	52	131K	rerank	Up to 40 RPM	Jun 17, 2026	—	Online
OpenRouter	OpenAI: gpt-oss-20b (free) Verified	52	131K	text	200 req/day (free tier)	Aug 5, 2025	26.0B	Online
Agnes AI	agnes-image-2.0-flash Verified	52	4K	👁️ image	30 RPM (1K)	Jun 30, 2026	—	Online
Agnes AI	agnes-image-2.1-flash Verified	52	4K	👁️ image	30 RPM (1K)	Jun 30, 2026	—	Online
Ollama Cloud	glm-4.6:cloud zai-glm-4.7	52	128K	text🧠 reasoning	Session/weekly limits (unpublished)	Sep 30, 2025	—	Online
Google Gemini	Gemini 2.5 Flash Gemini 2.5 Flash	52	1.0M	text👁️ imageaudiovideopdf🧠 reasoning	15 RPM, 1,500 RPD	May 20, 2025	—	Online
OpenRouter	Venice: Uncensored (free) Verified	51	33K	text	200 req/day (free tier)	Jul 9, 2025	—	Online
Cloudflare Workers AI	@cf/meta/llama-4-scout-17b-16e-instruct	51	10.0M	text	10K neurons/day (shared)	May 10, 2026	—	Online
OpenRouter	nousresearch/hermes-3-llama-3.1-405b Verified	51	8K		200 req/day (free tier)	Jun 29, 2026	—	Online
Google Gemini	Gemini 2.5 Pro Gemini 2.5 Pro	51	2.0M	text👁️ imageaudiovideopdf🧠 reasoning	5 RPM, 50 RPD	Jun 5, 2025	—	Online
OpenRouter	NVIDIA: Nemotron Nano 9B V2 (free) Verified NVIDIA: Nemotron Nano 9B V2 (free)	51	128K	text🧠 reasoning	200 req/day (free tier)	Sep 5, 2025	12.1B	Online
GitHub Models	Llama-4-Scout-17B-16E Llama-4-Scout-17B-16E	51	512K	text👁️ image	15 RPM, 150 RPD	Apr 5, 2025	—	Online
GitHub Models	Llama-4-Maverick-17B-128E Llama-4-Scout-17B-16E	51	256K	text👁️ image	10 RPM, 50 RPD	Apr 5, 2025	—	Online
GitHub Models	gpt-4o gpt-4o	51	128K	text👁️ imagepdf	10 RPM, 50 RPD	May 13, 2024	—	Online
GitHub Models	Mistral-Small-3.1	50	128K	text	15 RPM, 150 RPD	Mar 17, 2025	—	Online
OpenRouter	LiquidAI: LFM2.5-1.2B-Thinking (free) Verified	50	33K	text🧠 reasoning	200 req/day (free tier)	Jan 20, 2026	2.2B	Online
LLM7.io	gemini-2.5-flash-lite gemini-2.5-flash-lite	50	131K	text👁️ imageaudiovideopdf🧠 reasoning	30 RPM (120 with token)	Jun 17, 2025	—	Online
Aion Labs	Aion 2.0 Aion 2.0	50	128K	text	15 RPM, 20K TPD	Feb 23, 2026	—	Online
OVHcloud AI Endpoints	Qwen2.5-VL-72B-Instruct Qwen2.5-VL-72B-Instruct	50	128K	text👁️ image	2 RPM (anonymous)	Sep 1, 2024	—	Online
OVHcloud AI Endpoints	Mistral-Small-3.2-24B-Instruct Mistral-Small-3.2-24B-Instruct	50	128K	text	2 RPM (anonymous)	Jun 20, 2025	—	Online
OVHcloud AI Endpoints	Mistral-Nemo-Instruct-2407 Mistral-Nemo-Instruct-2407	50	128K	text	2 RPM (anonymous)	Jul 1, 2024	—	Online
GitHub Models	DeepSeek-R1 DeepSeek-R1	50	64K	text🧠 reasoning	15 RPM, 150 RPD	May 28, 2025	—	Online
Mistral AI	Codestral	49	256K	textcode	~1 RPS, 500K TPM	May 10, 2026	—	Online
Mistral AI	Mistral Large 3	49	256K	text	~1 RPS, 500K TPM	Dec 2, 2025	—	Online
NVIDIA NIM	abacusai/dracarys-llama-3.1-70b-instruct Verified	49	8K			Jun 29, 2026	—	Online
Agnes AI	agnes-video-v2.0 Verified	49	4K	video	2 RPM	Jun 30, 2026	—	Online
Groq	llama-4-scout-17b-16e-instruct Llama-4-Scout-17B-16E	49	131K	text👁️ image	30 RPM, 1,000 RPD	Apr 5, 2025	—	Online
OpenRouter	Meta: Llama 3.3 70B Instruct (free) Verified Meta: Llama 3.3 70B Instruct (free)	49	131K	text	200 req/day (free tier)	Dec 6, 2024	214.6M	Online
OVHcloud AI Endpoints	gpt-oss-20b gpt-oss-20b	49	128K	text	2 RPM (anonymous)	Aug 5, 2025	—	Online
LLM7.io	mistral-small-3.1-24b mistral-small-3.1-24b	49	32K	text	30 RPM (120 with token)	Mar 17, 2025	—	Online
OpenRouter	LiquidAI: LFM2.5-1.2B-Instruct (free) Verified	48	33K	text	200 req/day (free tier)	Jan 5, 2026	1.1B	Online
OpenRouter	Free Models Router Verified	48	200K	text👁️ image	200 req/day (free tier)	Feb 1, 2026	—	Online
Mistral AI	Mixtral 8x7B	48	33K	text	See provider page	Jun 27, 2026	—	Online
LLM7.io	deepseek-v3-0324 deepseek-v3-0324	48	131K	text	30 RPM (120 with token)	Mar 25, 2025	—	Online
NVIDIA NIM	nvidia/llama-3.3-nemotron-super-49b-v1.5 Verified nvidia/llama-3.3-nemotron-super-49b-v1.5	48	131K	text🧠 reasoning	Up to 40 RPM	Jul 25, 2025	197.8M	Online
ModelScope	LLM-Research/c4ai-command-r-plus-08-2024 Verified	47	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/openai/gpt-oss-120b Verified	47	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/nvidia/nemotron-3-120b-a12b Verified	47	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-large-en-v1.5 Verified	47	8K			Jun 29, 2026	—	Online
Glhf.chat	Mixtral 8x7B	47	33K	text	Unlimited for free models	Jun 27, 2026	—	Online
Mistral AI	Mistral 7B	47	33K	text	See provider page	Jun 27, 2026	—	Online
GitHub Models	Meta-Llama-3.3-70B Meta: Llama 3.3 70B Instruct (free)	47	131K	text	15 RPM, 150 RPD	Dec 6, 2024	—	Online
OVHcloud AI Endpoints	Meta-Llama-3_3-70B-Instruct Meta: Llama 3.3 70B Instruct (free)	47	131K	text	2 RPM (anonymous)	Dec 6, 2024	—	Online
LLM7.io	gpt-4o-mini gpt-4o-mini	47	131K	text👁️ imagepdf	30 RPM (120 with token)	Jul 18, 2024	—	Online
SiliconFlow	Abbreviation	46	131K	text	See provider page	May 10, 2026	—	Online
Groq	llama-3.3-70b-versatile Meta: Llama 3.3 70B Instruct (free)	46	131K	text	30 RPM, 1,000 RPD	Dec 6, 2024	—	Online
ModelScope	Qwen/Qwen3-235B-A22B-Thinking-2507 Verified Qwen/Qwen3-235B-A22B-Thinking-2507	46	8K	🧠 reasoning		Jul 25, 2025	—	Online
ModelScope	GLM-4.7-FlashX Verified	45	200K	🧠 reasoning		Jun 29, 2026	—	Online
Groq	GPT-OSS 20B Paid	45	131K	text	See provider page	Aug 5, 2025	26.0B	Online
NVIDIA NIM	mistralai/mistral-medium-3.5-128b Verified	45	8K			Jun 29, 2026	—	Online
OpenRouter	qwen/qwen3-coder Verified Qwen: Qwen3 Coder 480B A35B (free)	45	8K		200 req/day (free tier)	Jul 23, 2025	—	Online
Chutes.ai	DeepSeek-R1 DeepSeek-R1	45	131K	text🧠 reasoning	Community-powered, no hard cap	May 28, 2025	—	Online
Cloudflare Workers AI	@cf/mistralai/mistral-small-3.1-24b-instruct mistral-small-3.1-24b	45	128K	text	10K neurons/day (shared)	Mar 17, 2025	—	Online
Groq	qwen3-32b qwen3-32b	45	131K	text🧠 reasoning	30 RPM, 1,000 RPD	Apr 28, 2025	—	Online
Cohere	Command A (111B)	44	256K	text	20 RPM	May 10, 2026	—	Online
Hugging Face	Mixtral-8x7B-Instruct-v0.1	44	32K	text	Credit-metered	May 10, 2026	—	Online
NVIDIA NIM	Llama 3.3 Nemotron Super 49B v1 Verified	44	131K	🧠 reasoning		Jun 29, 2026	—	Online
SiliconFlow	deepseek-ai/DeepSeek-R1-Distill-Qwen-7B	44	131K	text🧠 reasoning	30 RPM, 60K TPM	May 10, 2026	—	Online
Hugging Face	Phi-3.5-mini-instruct	44	128K	text	Credit-metered	May 10, 2026	—	Online
Cohere	Command A+ (218B)	44	128K	text	20 RPM	Jun 17, 2026	—	Online
Cohere	Command R+	44	128K	text	20 RPM	May 10, 2026	—	Online
Cohere	Command R7B	44	128K	text	20 RPM	May 10, 2026	—	Online
ModelScope	GLM-5.2 Verified	44	1.0M	🧠 reasoning		Jun 29, 2026	—	Online
OpenCode Zen	Nemotron 3 Ultra 550B A55B Verified	44	1.0M	🧠 reasoning		Jun 28, 2026	—	Online
LLM7.io	Codestral (latest) Verified	44	256K	text		Jun 29, 2026	—	Online
Aion Labs	Aion 2.5	44	128K	text	15 RPM, 20K TPD	Jun 17, 2026	—	Online
OpenRouter	openai/gpt-oss-20b Verified	44	8K		200 req/day (free tier)	Jun 29, 2026	—	Online
OpenRouter	poolside/laguna-xs.2 Verified	44	8K		200 req/day (free tier)	Jun 29, 2026	—	Online
OpenRouter	poolside/laguna-m.1 Verified	44	8K		200 req/day (free tier)	Jun 29, 2026	—	Online
OpenRouter	OpenAI: gpt-oss-safeguard-20b Paid	44	131K	text	200 req/day (free tier)	Oct 29, 2025	6.0B	Online
OpenRouter	Nemotron 3 Nano 30B A3B Verified NVIDIA: Nemotron 3 Nano 30B A3B (free)	44	262K	🧠 reasoning	200 req/day (free tier)	Dec 14, 2025	—	Online
Google Gemini	Gemini 2.5 Flash Verified Gemini 2.5 Flash	44	1.0M	👁️ visionaudio🧠 reasoning		Jun 17, 2025	—	Online
Mistral AI	Mistral Nemo (12B) Mistral-Nemo-Instruct-2407	44	128K	text	~1 RPS, 500K TPM	Jul 1, 2024	—	Online
ModelScope	Qwen/Qwen3-Next-80B-A3B-Thinking Verified Qwen/Qwen3-Next-80B-A3B-Thinking	44	8K	🧠 reasoning		Sep 11, 2025	—	Online
OpenRouter	Meta: Llama 3.2 3B Instruct (free) Verified Meta: Llama 3.2 3B Instruct (free)	44	131K	text	200 req/day (free tier)	Sep 25, 2024	57.1M	Online
NVIDIA NIM	nvidia/llama-nemotron-embed-1b-v2 Verified nvidia/llama-nemotron-embed-1b-v2	44	131K	embeddingtext👁️ image	Up to 40 RPM	Feb 10, 2026	—	Online
NVIDIA NIM	nvidia/llama-nemotron-embed-vl-1b-v2 Verified nvidia/llama-nemotron-embed-1b-v2	44	131K	embeddingtext👁️ image	Up to 40 RPM	Feb 10, 2026	4.0B	Online
LLM7.io	qwen2.5-coder-32b qwen2.5-coder-32b	44	131K	textcode	30 RPM (120 with token)	Nov 11, 2024	—	Online
GitHub Models	Phi-4 Phi-4	44	131K	text	See provider page	Dec 12, 2024	—	Online
ModelScope	MiniMax-M3 Verified	43	512K	👁️ vision🧠 reasoning		Jun 29, 2026	—	Online
ModelScope	opencompass/CompassJudger-1-32B-Instruct Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	stepfun-ai/Step-3.5-Flash Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	stepfun-ai/Step-3.7-Flash Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	XGenerationLab/XiYanSQL-QwenCoder-32B-2412 Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	XGenerationLab/XiYanSQL-QwenCoder-32B-2504 Verified	43	8K			Jun 29, 2026	—	Online
Hugging Face	Mistral-7B-Instruct-v0.3	43	32K	text	Credit-metered	May 10, 2026	—	Online
ModelScope	Kimi K2.5 Verified	43	262K	👁️ vision🧠 reasoning		Jun 29, 2026	—	Online
NVIDIA NIM	Nemotron Mini 4B Instruct Verified	43	128K	text		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/zai-org/glm-4.7-flash Verified	43	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwq-32b Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	MiniMax-M2.5-highspeed Verified	43	205K	🧠 reasoning		Jun 29, 2026	—	Online
ModelScope	GLM-5.1 Verified	43	200K	🧠 reasoning		Jun 29, 2026	—	Online
ModelScope	GLM-5.1 Verified	43	200K	🧠 reasoning		Jun 29, 2026	—	Online
OpenRouter	qwen/qwen3-next-80b-a3b-instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	43	8K		200 req/day (free tier)	Sep 11, 2025	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.3-70b-instruct-fp8-fast Meta: Llama 3.3 70B Instruct (free)	43	131K	text	10K neurons/day (shared)	Dec 6, 2024	—	Online
NVIDIA NIM	meta/llama-3.1-70b-instruct Verified meta/llama-3.1-70b-instruct	43	131K	text	Up to 40 RPM	Jul 23, 2024	—	Online
ModelScope	Qwen/Qwen3-235B-A22B-Instruct-2507 Verified Qwen/Qwen3-235B-A22B-Instruct-2507	43	8K			Jul 21, 2025	—	Online
NVIDIA NIM	bytedance/seed-oss-36b-instruct Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/diffusiongemma-26b-a4b-it Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/gemma-2-2b-it Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/gemma-3n-e2b-it Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-3.2-90b-vision-instruct Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-4-maverick-17b-128e-instruct Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mistral-nemotron Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mistral-small-4-119b-2603 Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mixtral-8x7b-instruct-v0.1 Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/gliner-pii Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/ising-calibration-1-35b-a3b Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/riva-translate-4b-instruct-v1.1 Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	sarvamai/sarvam-m Verified	42	8K			Jun 29, 2026	—	Online
NVIDIA NIM	stockmark/stockmark-2-100b-instruct Verified	42	8K			Jun 29, 2026	—	Online
Google Gemini	Gemini Flash-Lite Latest Verified	42	1.0M	👁️ visionaudio🧠 reasoning		Jun 29, 2026	—	Online
OVHcloud AI Endpoints	Llama-3.1-8B-Instruct Llama-3.1-8B-Instruct	42	131K	text	2 RPM (anonymous)	Jul 23, 2024	—	Online
Z AI (Zhipu AI)	GLM-4.5-Air Verified GLM-4.5-Air	42	131K	🧠 reasoning		Jul 25, 2025	—	Online
ModelScope	Qwen/Qwen3-VL-235B-A22B-Instruct Verified Qwen/Qwen3-VL-235B-A22B-Instruct	42	8K			Sep 23, 2025	—	Online
Aion Labs	Aion-RP 1.0 (8B)	41	32K	text	15 RPM, 20K TPD	Jun 17, 2026	—	Online
LLM7.io	devstral-small-2:24b Verified	41	8K			Jun 29, 2026	—	Online
NVIDIA NIM	microsoft/phi-4-mini-instruct Verified	41	8K			Jun 29, 2026	—	Online
NVIDIA NIM	upstage/solar-10.7b-instruct Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-Coder-30B-A3B-Instruct Verified Qwen3-Coder-30B-A3B-Instruct	41	8K			Jul 31, 2025	—	Online
NVIDIA NIM	Nemotron Nano 12B v2 VL Verified NVIDIA: Nemotron Nano 12B 2 VL (free)	41	128K	👁️ vision🧠 reasoning		Oct 28, 2025	—	Online
Groq	llama-3.1-8b-instant Llama-3.1-8B-Instruct	41	131K	text	30 RPM, 1,000 RPD	Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/deepseek-ai/deepseek-r1-distill-qwen-32b @cf/deepseek-ai/deepseek-r1-distill-qwen-32b	41	32K	text🧠 reasoning	10K neurons/day (shared)	Jan 20, 2025	—	Online
NVIDIA NIM	meta/llama-3.2-11b-vision-instruct Verified meta/llama-3.2-11b-vision-instruct	41	131K	text👁️ image	Up to 40 RPM	Sep 25, 2024	—	Online
ModelScope	LLM-Research/Llama-4-Maverick-17B-128E-Instruct Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	MedAIBase/AntAngelMed Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	MiniMax/MiniMax-M1-80k Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	mistralai/Mistral-Small-Instruct-2409 Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	MusePublic/Qwen-Image-Edit Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	OpenGVLab/InternVL3_5-241B-A28B Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-21B-A3B-PT Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-300B-A47B-PT Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-VL-28B-A3B-PT Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen-Image-Edit Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-4B Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S1 Verified	40	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S2-Preview Verified	40	8K			Jun 29, 2026	—	Online
Grok (xAI)	Grok-2 Mini	40	131K	text	$25/month free credits, resets monthly	Jun 27, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-m3 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-2b-it-lora Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/moonshotai/kimi-k2.7-code Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/moonshotai/kimi-k2.6 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/ibm-granite/granite-4.0-h-micro Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-small-en-v1.5 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/zai-org/glm-5.2 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-base-en-v1.5 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/aisingapore/gemma-sea-lion-v4-27b-it Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/openai/gpt-oss-20b Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/meta/llama-4-scout-17b-16e-instruct Verified	40	8K			Jun 29, 2026	—	Online
Google Gemini	gemini-robotics-er-1.6-preview Verified	40	8K			Jun 29, 2026	—	Online
OpenCode Zen	DeepSeek V4 Flash Verified	40	1.0M	🧠 reasoning		Jun 28, 2026	—	Online
ModelScope	Qwen/Qwen3-Next-80B-A3B-Instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	40	8K			Sep 11, 2025	—	Online
Google Gemini	Gemini 2.5 Flash-Lite Verified gemini-2.5-flash-lite	40	1.0M	👁️ visionaudio🧠 reasoning		Jul 22, 2025	—	Online
NVIDIA NIM	meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free)	40	131K	text	Up to 40 RPM	Sep 25, 2024	—	Online
Chutes.ai	Llama 3.1 70B meta/llama-3.1-70b-instruct	40	131K	text	Community-powered, no hard cap	Jul 23, 2024	—	Online
ModelScope	Qwen/Qwen3-30B-A3B-Thinking-2507 Verified Qwen/Qwen3-30B-A3B-Thinking-2507	40	8K	🧠 reasoning		Aug 28, 2025	—	Online
Hugging Face	Qwen2.5-7B-Instruct Qwen2.5-7B-Instruct	40	131K	text	Credit-metered	Oct 16, 2024	—	Online
Mistral AI	Pixtral Large Pixtral Large	40	128K	text👁️ image	~1 RPS, 500K TPM	Nov 18, 2024	—	Online
NVIDIA NIM	nvidia/embed-qa-4 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1 Verified	39	131K	embeddingrerank	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.2-nv-embedqa-1b-v1 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nv-embed-v1 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nv-embedcode-7b-v1 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nv-embedqa-e5-v5 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nv-embedqa-mistral-7b-v2 Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	snowflake/arctic-embed-l Verified	39	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
ModelScope	meituan-longcat/LongCat-Flash-Lite Verified	39	8K			Jun 29, 2026	—	Online
ModelScope	mistralai/Ministral-8B-Instruct-2410 Verified	39	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S1-mini Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/mistral/mistral-7b-instruct-v0.2-lora Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/meta-llama/llama-2-7b-chat-hf-lora Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-7b-it-lora Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwen2.5-coder-32b-instruct Verified qwen2.5-coder-32b	39	8K			Nov 11, 2024	—	Online
Glhf.chat	Llama 3.1 70B meta/llama-3.1-70b-instruct	39	131K	text	Unlimited for free models	Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/deepseek-ai/deepseek-r1-distill-qwen-32b Verified @cf/deepseek-ai/deepseek-r1-distill-qwen-32b	39	8K	🧠 reasoning		Jan 29, 2025	—	Online
GitHub Models	Mistral Large (24.11) Mistral Large (24.11)	39	131K	text👁️ image	See provider page	Feb 26, 2024	—	Online
NVIDIA NIM	meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct	39	131K	text	Up to 40 RPM	Sep 25, 2024	—	Online
ModelScope	Qwen/Qwen3-VL-8B-Thinking Verified Qwen/Qwen3-VL-8B-Thinking	39	8K	🧠 reasoning		Oct 14, 2025	—	Online
NVIDIA NIM	mistralai/ministral-14b-instruct-2512 Verified mistralai/ministral-14b-instruct-2512	39	8K			Dec 2, 2025	—	Online
ModelScope	Qwen/Qwen3-235B-A22B Verified Qwen/Qwen3-235B-A22B	39	8K			Apr 28, 2025	—	Online
ModelScope	Qwen/Qwen3-VL-8B-Instruct Verified Qwen/Qwen3-VL-8B-Instruct	38	8K			Oct 14, 2025	—	Online
Groq	GPT-OSS Safeguard 20B Paid	37	131K	text	See provider page	Jun 27, 2026	6.0B	Online
OpenCode Zen	MiMo-V2.5 Verified	37	1.0M	👁️ visionaudio🧠 reasoning		Jun 28, 2026	—	Online
NVIDIA NIM	Llama-3.3-70B-Instruct Verified Meta: Llama 3.3 70B Instruct (free)	37	128K	text		Dec 6, 2024	—	Online
OpenRouter	meta-llama/llama-3.3-70b-instruct Verified Meta: Llama 3.3 70B Instruct (free)	37	8K		200 req/day (free tier)	Dec 6, 2024	—	Online
Cloudflare Workers AI	@cf/mistralai/mistral-small-3.1-24b-instruct Verified mistral-small-3.1-24b	36	8K			Mar 17, 2025	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-nano-vl-8b-v1 Verified	35	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/nvidia-nemotron-nano-9b-v2 Verified	35	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-guard-4-12b Verified meta/llama-guard-4-12b	35	164K	text👁️ image	Up to 40 RPM	Apr 30, 2025	—	Online
ModelScope	mistralai/Mistral-Large-Instruct-2407 Verified mistralai/Mistral-Large-Instruct-2407	35	8K			Nov 19, 2024	—	Online
ModelScope	Qwen/Qwen3-30B-A3B Verified Qwen/Qwen3-30B-A3B	35	8K			Apr 28, 2025	—	Online
OpenCode Zen	North Mini Code Verified	34	256K	🧠 reasoning		Jun 28, 2026	—	Online
Hugging Face	Meta-Llama-3.1-8B-Instruct Llama-3.1-8B-Instruct	34	128K	text	Credit-metered	Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/qwen/qwen3-30b-a3b-fp8 Verified Qwen/Qwen3-30B-A3B	34	8K			Apr 28, 2025	—	Online
ModelScope	Qwen/Qwen3-8B Verified Qwen/Qwen3-8B	34	8K			Apr 28, 2025	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-0.3B-PT Verified	33	8K			Jun 29, 2026	—	Online
OpenCode Zen	big-pickle Verified	33	N/A			Jun 28, 2026	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.3-70b-instruct-fp8-fast Verified Meta: Llama 3.3 70B Instruct (free)	33	8K			Dec 6, 2024	—	Online
OpenRouter	meta-llama/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free)	33	8K		200 req/day (free tier)	Sep 25, 2024	—	Online
ModelScope	Qwen/Qwen3-14B Verified Qwen/Qwen3-14B	33	8K			Apr 28, 2025	—	Online
Grok (xAI)	Grok-2	32	131K	text	$25/month free credits, resets monthly	Dec 12, 2024	—	Online
NVIDIA NIM	Llama 3.1 Nemotron Safety Guard 8B v3 Verified	32	128K	text		Jun 29, 2026	—	Online
NVIDIA NIM	Nemotron 3 Content Safety Verified	32	128K	text		Jun 29, 2026	—	Online
NVIDIA NIM	Nemotron Content Safety Reasoning 4B Verified	32	128K	🧠 reasoning		Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-32B Verified qwen3-32b	32	8K			Apr 28, 2025	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemoguard-8b-content-safety Verified	30	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemoguard-8b-topic-control Verified	30	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-3.1-8b-instruct Verified Llama-3.1-8B-Instruct	30	8K			Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free)	29	8K			Sep 25, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.1-8b-instruct-fp8 Verified Llama-3.1-8B-Instruct	28	8K			Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-guard-3-8b Verified	27	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwen3-embedding-0.6b Verified	27	8K	embedding		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/pfnet/plamo-embedding-1b Verified	27	8K	embedding		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/embeddinggemma-300m Verified	27	8K	embedding		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct	27	8K			Sep 25, 2024	—	Online

How to Get Started with Free LLM APIs

Pick a free LLM model — Click any model name to see details, rate limits, and API key signup link.
Get your API key — Sign up on the provider's website (most require no credit card).
Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the 📖 Glossary — 22 terms explained in plain English →

FAQ: Common questions about free LLM APIs →

About This Free LLM API Directory

Finding reliable free LLM API resources online can be frustrating. Many developers traditionally rely on static GitHub repositories to find endpoints. While those lists are a good starting point, they often become outdated quickly, leaving you with dead links, expired API keys, and unverified rate limits.

That's why we built this dynamic, auto-updating directory. If you are looking for a reliable alternative to GitHub free LLM API lists, this page tracks over 312 free LLM models online in real-time. Whether you need a free API key for text generation, vision, or coding tasks, you can compare context windows, capabilities, and strict rate limit data side-by-side.

Our goal is to be the most accurate and comprehensive list of free AI APIs for developers. Use the filters above to find providers that don't require credit cards or phone verification, and grab your free API keys to start building immediately.

Directory of Free LLM APIs: Compare 312+ Models

How to Get Started with Free LLM APIs

About This Free LLM API Directory