Directory of Free LLM APIs: Compare 312+ Models
Showing 312 of 312 free LLM models
Discover and filter 312+ free LLM models across 30 providers. Find APIs by capability (vision, reasoning), rate limits, or no-credit-card requirements, and get the perfect free AI model for your project.
| Provider | Model | Score | Context | Modality | Rate Limit | Status |
|---|---|---|---|---|---|---|
| MiniMax: MiniMax M3 Paid MiniMax: MiniMax M3 | 93 | 1.0M | 200 req/day (free tier) | Online | ||
| Nex AGI: Nex-N2-Pro Paid Nex AGI: Nex-N2-Pro | 91 | 262K | 200 req/day (free tier) | Online | ||
| Gemini 3.5 Flash | 90 | 1.0M | 15 RPM, 1,500 RPD | Online | ||
| minimaxai/minimax-m3 Verified MiniMax: MiniMax M3 | 89 | 1.0M | Up to 40 RPM | Online | ||
| DeepSeek: DeepSeek V4 Flash | 88 | 1.0M | 200 req/day (free tier) | Online | ||
| NVIDIA: Nemotron 3 Ultra (free) Verified NVIDIA: Nemotron 3 Ultra (free) | 88 | 1.0M | 200 req/day (free tier) | Online | ||
| MoonshotAI: Kimi K2.6 | 87 | 262K | 200 req/day (free tier) | Online | ||
| deepseek-ai/deepseek-v4-pro Verified deepseek-ai/deepseek-v4-pro | 86 | 1.0M | Up to 40 RPM | Online | ||
| Z.ai: GLM 5.1 Paid | 85 | 203K | 200 req/day (free tier) | Online | ||
| deepseek-ai/deepseek-v4-flash Verified DeepSeek: DeepSeek V4 Flash | 83 | 1.0M | Up to 40 RPM | Online | ||
| moonshotai/kimi-k2.6 Verified MoonshotAI: Kimi K2.6 | 82 | 262K | Up to 40 RPM | Online | ||
| agnes-2.0-flash Verified | 81 | 256K | 30 RPM | Online | ||
| Qwen3.6-27B | 81 | 131K | 2 RPM (anonymous) | Online | ||
| z-ai/glm-5.1 Verified | 80 | 203K | Up to 40 RPM | Online | ||
| 80 | 262K | 200 req/day (free tier) | Online | |||
| Cohere: North Mini Code (free) Verified Cohere: North Mini Code (free) | 79 | 256K | 200 req/day (free tier) | Online | ||
| Nemotron 3 Ultra 550B A55B Verified NVIDIA: Nemotron 3 Ultra (free) | 77 | 1.0M | 200 req/day (free tier) | Online | ||
| minimaxai/minimax-m2.7 Verified MiniMax: MiniMax M3 | 76 | 205K | Up to 40 RPM | Online | ||
| stepfun-ai/step-3.7-flash Verified | 76 | 256K | Up to 40 RPM | Online | ||
| Qwen/Qwen3.5-27B | 76 | 131K | 2,000 RPD total; <=500 RPD/model (dynamic) | Online | ||
| Google: Gemma 4 31B (free) Verified Google: Gemma 4 31B (free) | 76 | 262K | 200 req/day (free tier) | Online | ||
| MiniMax: MiniMax M3 | 75 | 205K | 200 req/day (free tier) | Online | ||
| deepseek-ai/DeepSeek-V4-Pro Verified deepseek-ai/deepseek-v4-pro | 73 | 8K | Online | |||
| Google: Gemma 4 26B A4B (free) Verified Google: Gemma 4 26B A4B (free) | 73 | 262K | 200 req/day (free tier) | Online | ||
| Qwen3.5-9B | 73 | 131K | 2 RPM (anonymous) | Online | ||
| MiniMax-M2.7 | 73 | 128K | 20 RPM, 20 RPD, 200K TPD | Online | ||
| MiniMax: MiniMax M3 | 72 | 196K | ~200 req/hr | Online | ||
| 72 | 262K | 200 req/day (free tier) | Online | |||
| Qwen3.5-397B-A17B | 72 | 131K | 2 RPM (anonymous) | Online | ||
| qwen/qwen3.5-397b-a17b Verified Qwen3.5-397B-A17B | 71 | 256K | Up to 40 RPM | Online | ||
| Qwen/Qwen3.5-35B-A3B | 71 | 131K | 2,000 RPD total; <=500 RPD/model (dynamic) | Online | ||
| NVIDIA: Nemotron 3 Nano Omni (free) Verified NVIDIA: Nemotron 3 Nano Omni (free) | 71 | 256K | 200 req/day (free tier) | Online | ||
| NVIDIA: Nemotron 3 Super (free) Verified NVIDIA: Nemotron 3 Super (free) | 71 | 1.0M | 200 req/day (free tier) | Online | ||
| qwen/qwen3.5-122b-a10b Verified qwen/qwen3.5-122b-a10b | 71 | 262K | Up to 40 RPM | Online | ||
| deepseek-ai/DeepSeek-V4-Flash Verified DeepSeek: DeepSeek V4 Flash | 70 | 8K | Online | |||
| 69 | 131K | ~200 req/hr | Online | |||
| Poolside: Laguna XS.2 (free) Verified | 68 | 262K | 200 req/day (free tier) | Online | ||
| Poolside: Laguna M.1 (free) Verified | 68 | 262K | 200 req/day (free tier) | Online | ||
| NVIDIA: Nemotron 3 Super (free) | 68 | 262K | ~200 req/hr | Online | ||
| Qwen/Qwen3.5-27B Verified Qwen/Qwen3.5-27B | 67 | 8K | Online | |||
| Google: Gemma 4 26B A4B (free) | 66 | 256K | 10K neurons/day (shared) | Online | ||
| 65 | 128K | Session/weekly limits (unpublished) | Online | |||
| Gemma 4 31B IT Verified Google: Gemma 4 31B (free) | 65 | 262K | 200 req/day (free tier) | Online | ||
| MoonshotAI: Kimi K2.6 | 64 | 262K | 10K neurons/day (shared) | Online | ||
| stepfun-ai/step-3.5-flash Verified | 64 | 262K | Up to 40 RPM | Online | ||
| GLM-4.7-Flash | 64 | 200K | 1 concurrent request | Online | ||
| Gemma 4 26B A4B IT Verified Google: Gemma 4 26B A4B (free) | 63 | 262K | 200 req/day (free tier) | Online | ||
| GLM-4.6V-Flash | 63 | 128K | 1 concurrent request | Online | ||
| Qwen/Qwen3.5-35B-A3B Verified Qwen/Qwen3.5-35B-A3B | 62 | 8K | Online | |||
| Gemini 3.1 Flash-Lite | 62 | 1.0M | 30 RPM, 1,500 RPD | Online | ||
| o4-mini | 62 | 200K | 10 RPM, 50 RPD | Online | ||
| 61 | 256K | ~1 RPS, 500K TPM | Online | |||
| 61 | 131K | ~200 req/hr | Online | |||
| Gemma 4 31B IT Verified Google: Gemma 4 31B (free) | 61 | 262K | Online | |||
| deepseek-ai/DeepSeek-V3.2 Verified deepseek-ai/DeepSeek-V3.2 | 61 | 8K | Online | |||
| deepseek-r1:cloud | 61 | 128K | Session/weekly limits (unpublished) | Online | ||
| 60 | 128K | Session/weekly limits (unpublished) | Online | |||
| Nemotron 3 Super 120B A12B Verified NVIDIA: Nemotron 3 Super (free) | 60 | 262K | 200 req/day (free tier) | Online | ||
| zai-glm-4.7 | 60 | 128K | 10 RPM, 100 RPD, 1M TPD | Online | ||
| 59 | 128K | 20 RPM, 20 RPD, 200K TPD | Online | |||
| Nous: Hermes 3 405B Instruct (free) Verified | 59 | 131K | 200 req/day (free tier) | Online | ||
| 59 | 256K | See provider page | Online | |||
| Gemma 4 26B A4B IT Verified Google: Gemma 4 26B A4B (free) | 59 | 262K | Online | |||
| Qwen/Qwen3.5-397B-A17B Verified Qwen3.5-397B-A17B | 59 | 8K | Online | |||
| NVIDIA: Nemotron 3 Nano Omni (free) | 59 | 256K | Online | |||
| Qwen/Qwen3.5-122B-A10B Verified qwen/qwen3.5-122b-a10b | 59 | 8K | Online | |||
| Qwen: Qwen3 Coder 480B A35B (free) Verified Qwen: Qwen3 Coder 480B A35B (free) | 59 | 1.0M | 200 req/day (free tier) | Online | ||
| gpt-4.1 | 58 | 1.0M | 10 RPM, 50 RPD | Online | ||
| OpenAI: gpt-oss-120b (free) Verified OpenAI: gpt-oss-120b (free) | 58 | 131K | 200 req/day (free tier) | Online | ||
| NVIDIA: Nemotron 3.5 Content Safety (free) | 58 | 128K | 200 req/day (free tier) | Online | ||
| gemma-4-31B-it (Preview) | 57 | 128K | 20 RPM, 20 RPD, 200K TPD | Online | ||
| 56 | 128K | 10K neurons/day (shared) | Online | |||
| 56 | 256K | ~200 req/hr | Online | |||
| @cf/google/gemma-4-26b-a4b-it Verified Google: Gemma 4 26B A4B (free) | 56 | 8K | Online | |||
| gpt-4.1-mini | 56 | 1.0M | 15 RPM, 150 RPD | Online | ||
| deepseek-r1-0528 | 56 | 131K | 30 RPM (120 with token) | Online | ||
| Qwen: Qwen3 Next 80B A3B Instruct (free) | 56 | 262K | 200 req/day (free tier) | Online | ||
| gpt-5 | 56 | 200K | 10 RPM, 50 RPD | Online | ||
| @cf/zhipuai/glm-4.7-flash | 56 | 131K | 10K neurons/day (shared) | Online | ||
| gpt-oss:120b-cloud | 56 | 128K | Session/weekly limits (unpublished) | Online | ||
| Google: Lyria 3 Pro Preview Verified Google: Lyria 3 Pro Preview | 56 | 1.0M | 200 req/day (free tier) | Online | ||
| Google: Lyria 3 Clip Preview Verified Google: Lyria 3 Clip Preview | 56 | 1.0M | 200 req/day (free tier) | Online | ||
| agnes-1.5-flash Verified | 55 | 256K | 30 RPM | Online | ||
| Qwen3-Coder-30B-A3B-Instruct | 55 | 262K | 2 RPM (anonymous) | Online | ||
| 54 | 262K | Session/weekly limits (unpublished) | Online | |||
| Gemini 3.1 Flash Lite Verified Gemini 3.1 Flash-Lite | 54 | 1.0M | Online | |||
| Gemini 3.1 Flash Lite Verified Gemini 3.1 Flash-Lite | 54 | 1.0M | Online | |||
| gpt-oss:120b-cloud | 54 | 128K | 30 RPM, 14,400 RPD, 1M TPD | Online | ||
| NVIDIA: Nemotron 3 Nano 30B A3B (free) | 54 | 256K | 200 req/day (free tier) | Online | ||
| Z.ai: GLM 4.5 Air Paid Z.ai: GLM 4.5 Air | 54 | 131K | 200 req/day (free tier) | Online | ||
| 53 | 8K | Online | ||||
| GPT OSS 120B Verified | 53 | 131K | 200 req/day (free tier) | Online | ||
| nvidia/nemotron-3.5-content-safety Verified NVIDIA: Nemotron 3.5 Content Safety (free) | 53 | 128K | Up to 40 RPM | Online | ||
| NVIDIA: Nemotron Nano 12B 2 VL (free) | 53 | 128K | 200 req/day (free tier) | Online | ||
| DeepSeek-V3.1 | 53 | 128K | 20 RPM, 20 RPD, 200K TPD | Online | ||
| 52 | 256K | ~1 RPS, 500K TPM | Online | |||
| GPT-OSS 120B Paid | 52 | 131K | See provider page | Online | ||
| nvidia/nemoretriever-parse Verified | 52 | 131K | Up to 40 RPM | Online | ||
| OpenAI: gpt-oss-20b (free) Verified | 52 | 131K | 200 req/day (free tier) | Online | ||
| agnes-image-2.0-flash Verified | 52 | 4K | 30 RPM (1K) | Online | ||
| agnes-image-2.1-flash Verified | 52 | 4K | 30 RPM (1K) | Online | ||
| zai-glm-4.7 | 52 | 128K | Session/weekly limits (unpublished) | Online | ||
| Gemini 2.5 Flash | 52 | 1.0M | 15 RPM, 1,500 RPD | Online | ||
| Venice: Uncensored (free) Verified | 51 | 33K | 200 req/day (free tier) | Online | ||
| 51 | 10.0M | 10K neurons/day (shared) | Online | |||
| nousresearch/hermes-3-llama-3.1-405b Verified | 51 | 8K | 200 req/day (free tier) | Online | ||
| Gemini 2.5 Pro | 51 | 2.0M | 5 RPM, 50 RPD | Online | ||
| NVIDIA: Nemotron Nano 9B V2 (free) Verified NVIDIA: Nemotron Nano 9B V2 (free) | 51 | 128K | 200 req/day (free tier) | Online | ||
| Llama-4-Scout-17B-16E | 51 | 512K | 15 RPM, 150 RPD | Online | ||
| Llama-4-Scout-17B-16E | 51 | 256K | 10 RPM, 50 RPD | Online | ||
| gpt-4o | 51 | 128K | 10 RPM, 50 RPD | Online | ||
| 50 | 128K | 15 RPM, 150 RPD | Online | |||
| 50 | 33K | 200 req/day (free tier) | Online | |||
| gemini-2.5-flash-lite | 50 | 131K | 30 RPM (120 with token) | Online | ||
| Aion 2.0 | 50 | 128K | 15 RPM, 20K TPD | Online | ||
| Qwen2.5-VL-72B-Instruct | 50 | 128K | 2 RPM (anonymous) | Online | ||
| Mistral-Small-3.2-24B-Instruct | 50 | 128K | 2 RPM (anonymous) | Online | ||
| Mistral-Nemo-Instruct-2407 | 50 | 128K | 2 RPM (anonymous) | Online | ||
| DeepSeek-R1 | 50 | 64K | 15 RPM, 150 RPD | Online | ||
| 49 | 256K | ~1 RPS, 500K TPM | Online | |||
| 49 | 256K | ~1 RPS, 500K TPM | Online | |||
| 49 | 8K | Online | ||||
| agnes-video-v2.0 Verified | 49 | 4K | 2 RPM | Online | ||
| Llama-4-Scout-17B-16E | 49 | 131K | 30 RPM, 1,000 RPD | Online | ||
| Meta: Llama 3.3 70B Instruct (free) Verified Meta: Llama 3.3 70B Instruct (free) | 49 | 131K | 200 req/day (free tier) | Online | ||
| gpt-oss-20b | 49 | 128K | 2 RPM (anonymous) | Online | ||
| mistral-small-3.1-24b | 49 | 32K | 30 RPM (120 with token) | Online | ||
| 48 | 33K | 200 req/day (free tier) | Online | |||
| Free Models Router Verified | 48 | 200K | 200 req/day (free tier) | Online | ||
| 48 | 33K | See provider page | Online | |||
| deepseek-v3-0324 | 48 | 131K | 30 RPM (120 with token) | Online | ||
| nvidia/llama-3.3-nemotron-super-49b-v1.5 | 48 | 131K | Up to 40 RPM | Online | ||
| 47 | 8K | Online | ||||
| @cf/openai/gpt-oss-120b Verified | 47 | 8K | Online | |||
| @cf/nvidia/nemotron-3-120b-a12b Verified | 47 | 8K | Online | |||
| @cf/baai/bge-large-en-v1.5 Verified | 47 | 8K | Online | |||
| 47 | 33K | Unlimited for free models | Online | |||
| 47 | 33K | See provider page | Online | |||
| Meta: Llama 3.3 70B Instruct (free) | 47 | 131K | 15 RPM, 150 RPD | Online | ||
| Meta: Llama 3.3 70B Instruct (free) | 47 | 131K | 2 RPM (anonymous) | Online | ||
| gpt-4o-mini | 47 | 131K | 30 RPM (120 with token) | Online | ||
| 46 | 131K | See provider page | Online | |||
| Meta: Llama 3.3 70B Instruct (free) | 46 | 131K | 30 RPM, 1,000 RPD | Online | ||
| Qwen/Qwen3-235B-A22B-Thinking-2507 Verified Qwen/Qwen3-235B-A22B-Thinking-2507 | 46 | 8K | Online | |||
| GLM-4.7-FlashX Verified | 45 | 200K | Online | |||
| GPT-OSS 20B Paid | 45 | 131K | See provider page | Online | ||
| mistralai/mistral-medium-3.5-128b Verified | 45 | 8K | Online | |||
| qwen/qwen3-coder Verified Qwen: Qwen3 Coder 480B A35B (free) | 45 | 8K | 200 req/day (free tier) | Online | ||
| DeepSeek-R1 | 45 | 131K | Community-powered, no hard cap | Online | ||
| mistral-small-3.1-24b | 45 | 128K | 10K neurons/day (shared) | Online | ||
| qwen3-32b | 45 | 131K | 30 RPM, 1,000 RPD | Online | ||
| 44 | 256K | 20 RPM | Online | |||
| 44 | 32K | Credit-metered | Online | |||
| Llama 3.3 Nemotron Super 49B v1 Verified | 44 | 131K | Online | |||
| 44 | 131K | 30 RPM, 60K TPM | Online | |||
| 44 | 128K | Credit-metered | Online | |||
| 44 | 128K | 20 RPM | Online | |||
| 44 | 128K | 20 RPM | Online | |||
| 44 | 128K | 20 RPM | Online | |||
| GLM-5.2 Verified | 44 | 1.0M | Online | |||
| Nemotron 3 Ultra 550B A55B Verified | 44 | 1.0M | Online | |||
| Codestral (latest) Verified | 44 | 256K | Online | |||
| 44 | 128K | 15 RPM, 20K TPD | Online | |||
| openai/gpt-oss-20b Verified | 44 | 8K | 200 req/day (free tier) | Online | ||
| poolside/laguna-xs.2 Verified | 44 | 8K | 200 req/day (free tier) | Online | ||
| poolside/laguna-m.1 Verified | 44 | 8K | 200 req/day (free tier) | Online | ||
| 44 | 131K | 200 req/day (free tier) | Online | |||
| Nemotron 3 Nano 30B A3B Verified NVIDIA: Nemotron 3 Nano 30B A3B (free) | 44 | 262K | 200 req/day (free tier) | Online | ||
| Gemini 2.5 Flash Verified Gemini 2.5 Flash | 44 | 1.0M | Online | |||
| Mistral-Nemo-Instruct-2407 | 44 | 128K | ~1 RPS, 500K TPM | Online | ||
| Qwen/Qwen3-Next-80B-A3B-Thinking Verified Qwen/Qwen3-Next-80B-A3B-Thinking | 44 | 8K | Online | |||
| Meta: Llama 3.2 3B Instruct (free) Verified Meta: Llama 3.2 3B Instruct (free) | 44 | 131K | 200 req/day (free tier) | Online | ||
| nvidia/llama-nemotron-embed-1b-v2 Verified nvidia/llama-nemotron-embed-1b-v2 | 44 | 131K | Up to 40 RPM | Online | ||
| nvidia/llama-nemotron-embed-vl-1b-v2 Verified nvidia/llama-nemotron-embed-1b-v2 | 44 | 131K | Up to 40 RPM | Online | ||
| qwen2.5-coder-32b | 44 | 131K | 30 RPM (120 with token) | Online | ||
| Phi-4 | 44 | 131K | See provider page | Online | ||
| MiniMax-M3 Verified | 43 | 512K | Online | |||
| 43 | 8K | Online | ||||
| stepfun-ai/Step-3.5-Flash Verified | 43 | 8K | Online | |||
| stepfun-ai/Step-3.7-Flash Verified | 43 | 8K | Online | |||
| 43 | 8K | Online | ||||
| 43 | 8K | Online | ||||
| 43 | 32K | Credit-metered | Online | |||
| Kimi K2.5 Verified | 43 | 262K | Online | |||
| Nemotron Mini 4B Instruct Verified | 43 | 128K | Online | |||
| @cf/zai-org/glm-4.7-flash Verified | 43 | 8K | Online | |||
| @cf/qwen/qwq-32b Verified | 43 | 8K | Online | |||
| MiniMax-M2.5-highspeed Verified | 43 | 205K | Online | |||
| GLM-5.1 Verified | 43 | 200K | Online | |||
| GLM-5.1 Verified | 43 | 200K | Online | |||
| qwen/qwen3-next-80b-a3b-instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free) | 43 | 8K | 200 req/day (free tier) | Online | ||
| Meta: Llama 3.3 70B Instruct (free) | 43 | 131K | 10K neurons/day (shared) | Online | ||
| meta/llama-3.1-70b-instruct Verified meta/llama-3.1-70b-instruct | 43 | 131K | Up to 40 RPM | Online | ||
| Qwen/Qwen3-235B-A22B-Instruct-2507 Verified Qwen/Qwen3-235B-A22B-Instruct-2507 | 43 | 8K | Online | |||
| bytedance/seed-oss-36b-instruct Verified | 42 | 8K | Online | |||
| google/diffusiongemma-26b-a4b-it Verified | 42 | 8K | Online | |||
| google/gemma-2-2b-it Verified | 42 | 8K | Online | |||
| google/gemma-3n-e2b-it Verified | 42 | 8K | Online | |||
| meta/llama-3.2-90b-vision-instruct Verified | 42 | 8K | Online | |||
| 42 | 8K | Online | ||||
| mistralai/mistral-nemotron Verified | 42 | 8K | Online | |||
| mistralai/mistral-small-4-119b-2603 Verified | 42 | 8K | Online | |||
| mistralai/mixtral-8x7b-instruct-v0.1 Verified | 42 | 8K | Online | |||
| nvidia/gliner-pii Verified | 42 | 8K | Online | |||
| nvidia/ising-calibration-1-35b-a3b Verified | 42 | 8K | Online | |||
| 42 | 8K | Online | ||||
| sarvamai/sarvam-m Verified | 42 | 8K | Online | |||
| stockmark/stockmark-2-100b-instruct Verified | 42 | 8K | Online | |||
| Gemini Flash-Lite Latest Verified | 42 | 1.0M | Online | |||
| Llama-3.1-8B-Instruct | 42 | 131K | 2 RPM (anonymous) | Online | ||
| GLM-4.5-Air Verified GLM-4.5-Air | 42 | 131K | Online | |||
| Qwen/Qwen3-VL-235B-A22B-Instruct Verified Qwen/Qwen3-VL-235B-A22B-Instruct | 42 | 8K | Online | |||
| 41 | 32K | 15 RPM, 20K TPD | Online | |||
| devstral-small-2:24b Verified | 41 | 8K | Online | |||
| microsoft/phi-4-mini-instruct Verified | 41 | 8K | Online | |||
| upstage/solar-10.7b-instruct Verified | 41 | 8K | Online | |||
| Qwen/Qwen3-Coder-30B-A3B-Instruct Verified Qwen3-Coder-30B-A3B-Instruct | 41 | 8K | Online | |||
| Nemotron Nano 12B v2 VL Verified NVIDIA: Nemotron Nano 12B 2 VL (free) | 41 | 128K | Online | |||
| Llama-3.1-8B-Instruct | 41 | 131K | 30 RPM, 1,000 RPD | Online | ||
| @cf/deepseek-ai/deepseek-r1-distill-qwen-32b | 41 | 32K | 10K neurons/day (shared) | Online | ||
| meta/llama-3.2-11b-vision-instruct Verified meta/llama-3.2-11b-vision-instruct | 41 | 131K | Up to 40 RPM | Online | ||
| 40 | 8K | Online | ||||
| MedAIBase/AntAngelMed Verified | 40 | 8K | Online | |||
| MiniMax/MiniMax-M1-80k Verified | 40 | 8K | Online | |||
| 40 | 8K | Online | ||||
| MusePublic/Qwen-Image-Edit Verified | 40 | 8K | Online | |||
| OpenGVLab/InternVL3_5-241B-A28B Verified | 40 | 8K | Online | |||
| PaddlePaddle/ERNIE-4.5-21B-A3B-PT Verified | 40 | 8K | Online | |||
| PaddlePaddle/ERNIE-4.5-300B-A47B-PT Verified | 40 | 8K | Online | |||
| PaddlePaddle/ERNIE-4.5-VL-28B-A3B-PT Verified | 40 | 8K | Online | |||
| Qwen/Qwen-Image-Edit Verified | 40 | 8K | Online | |||
| Qwen/Qwen3-4B Verified | 40 | 8K | Online | |||
| Shanghai_AI_Laboratory/Intern-S1 Verified | 40 | 8K | Online | |||
| 40 | 8K | Online | ||||
| 40 | 131K | $25/month free credits, resets monthly | Online | |||
| @cf/baai/bge-m3 Verified | 40 | 8K | Online | |||
| @cf/google/gemma-2b-it-lora Verified | 40 | 8K | Online | |||
| @cf/moonshotai/kimi-k2.7-code Verified | 40 | 8K | Online | |||
| @cf/moonshotai/kimi-k2.6 Verified | 40 | 8K | Online | |||
| @cf/ibm-granite/granite-4.0-h-micro Verified | 40 | 8K | Online | |||
| @cf/baai/bge-small-en-v1.5 Verified | 40 | 8K | Online | |||
| @cf/zai-org/glm-5.2 Verified | 40 | 8K | Online | |||
| @cf/baai/bge-base-en-v1.5 Verified | 40 | 8K | Online | |||
| 40 | 8K | Online | ||||
| @cf/openai/gpt-oss-20b Verified | 40 | 8K | Online | |||
| 40 | 8K | Online | ||||
| gemini-robotics-er-1.6-preview Verified | 40 | 8K | Online | |||
| DeepSeek V4 Flash Verified | 40 | 1.0M | Online | |||
| Qwen/Qwen3-Next-80B-A3B-Instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free) | 40 | 8K | Online | |||
| Gemini 2.5 Flash-Lite Verified gemini-2.5-flash-lite | 40 | 1.0M | Online | |||
| meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free) | 40 | 131K | Up to 40 RPM | Online | ||
| meta/llama-3.1-70b-instruct | 40 | 131K | Community-powered, no hard cap | Online | ||
| Qwen/Qwen3-30B-A3B-Thinking-2507 Verified Qwen/Qwen3-30B-A3B-Thinking-2507 | 40 | 8K | Online | |||
| Qwen2.5-7B-Instruct | 40 | 131K | Credit-metered | Online | ||
| Pixtral Large | 40 | 128K | ~1 RPS, 500K TPM | Online | ||
| nvidia/embed-qa-4 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| 39 | 131K | Up to 40 RPM | Online | |||
| nvidia/llama-3.2-nv-embedqa-1b-v1 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| nvidia/nv-embed-v1 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| nvidia/nv-embedcode-7b-v1 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| nvidia/nv-embedqa-e5-v5 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| nvidia/nv-embedqa-mistral-7b-v2 Verified | 39 | 131K | Up to 40 RPM | Online | ||
| snowflake/arctic-embed-l Verified | 39 | 131K | Up to 40 RPM | Online | ||
| meituan-longcat/LongCat-Flash-Lite Verified | 39 | 8K | Online | |||
| mistralai/Ministral-8B-Instruct-2410 Verified | 39 | 8K | Online | |||
| 39 | 8K | Online | ||||
| 39 | 8K | Online | ||||
| 39 | 8K | Online | ||||
| @cf/google/gemma-7b-it-lora Verified | 39 | 8K | Online | |||
| @cf/qwen/qwen2.5-coder-32b-instruct Verified qwen2.5-coder-32b | 39 | 8K | Online | |||
| meta/llama-3.1-70b-instruct | 39 | 131K | Unlimited for free models | Online | ||
| @cf/deepseek-ai/deepseek-r1-distill-qwen-32b | 39 | 8K | Online | |||
| Mistral Large (24.11) | 39 | 131K | See provider page | Online | ||
| meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct | 39 | 131K | Up to 40 RPM | Online | ||
| Qwen/Qwen3-VL-8B-Thinking Verified Qwen/Qwen3-VL-8B-Thinking | 39 | 8K | Online | |||
| mistralai/ministral-14b-instruct-2512 | 39 | 8K | Online | |||
| Qwen/Qwen3-235B-A22B Verified Qwen/Qwen3-235B-A22B | 39 | 8K | Online | |||
| Qwen/Qwen3-VL-8B-Instruct Verified Qwen/Qwen3-VL-8B-Instruct | 38 | 8K | Online | |||
| 37 | 131K | See provider page | Online | |||
| MiMo-V2.5 Verified | 37 | 1.0M | Online | |||
| Llama-3.3-70B-Instruct Verified Meta: Llama 3.3 70B Instruct (free) | 37 | 128K | Online | |||
| meta-llama/llama-3.3-70b-instruct Verified Meta: Llama 3.3 70B Instruct (free) | 37 | 8K | 200 req/day (free tier) | Online | ||
| mistral-small-3.1-24b | 36 | 8K | Online | |||
| 35 | 8K | Online | ||||
| nvidia/nvidia-nemotron-nano-9b-v2 Verified | 35 | 8K | Online | |||
| meta/llama-guard-4-12b Verified meta/llama-guard-4-12b | 35 | 164K | Up to 40 RPM | Online | ||
| mistralai/Mistral-Large-Instruct-2407 | 35 | 8K | Online | |||
| Qwen/Qwen3-30B-A3B Verified Qwen/Qwen3-30B-A3B | 35 | 8K | Online | |||
| North Mini Code Verified | 34 | 256K | Online | |||
| Llama-3.1-8B-Instruct | 34 | 128K | Credit-metered | Online | ||
| @cf/qwen/qwen3-30b-a3b-fp8 Verified Qwen/Qwen3-30B-A3B | 34 | 8K | Online | |||
| Qwen/Qwen3-8B Verified Qwen/Qwen3-8B | 34 | 8K | Online | |||
| PaddlePaddle/ERNIE-4.5-0.3B-PT Verified | 33 | 8K | Online | |||
| big-pickle Verified | 33 | N/A | Online | |||
| Meta: Llama 3.3 70B Instruct (free) | 33 | 8K | Online | |||
| meta-llama/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free) | 33 | 8K | 200 req/day (free tier) | Online | ||
| Qwen/Qwen3-14B Verified Qwen/Qwen3-14B | 33 | 8K | Online | |||
| 32 | 131K | $25/month free credits, resets monthly | Online | |||
| 32 | 128K | Online | ||||
| Nemotron 3 Content Safety Verified | 32 | 128K | Online | |||
| Nemotron Content Safety Reasoning 4B Verified | 32 | 128K | Online | |||
| Qwen/Qwen3-32B Verified qwen3-32b | 32 | 8K | Online | |||
| 30 | 8K | Online | ||||
| 30 | 8K | Online | ||||
| meta/llama-3.1-8b-instruct Verified Llama-3.1-8B-Instruct | 30 | 8K | Online | |||
| @cf/meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free) | 29 | 8K | Online | |||
| @cf/meta/llama-3.1-8b-instruct-fp8 Verified Llama-3.1-8B-Instruct | 28 | 8K | Online | |||
| @cf/meta/llama-guard-3-8b Verified | 27 | 8K | Online | |||
| @cf/qwen/qwen3-embedding-0.6b Verified | 27 | 8K | Online | |||
| @cf/pfnet/plamo-embedding-1b Verified | 27 | 8K | Online | |||
| @cf/google/embeddinggemma-300m Verified | 27 | 8K | Online | |||
| @cf/meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct | 27 | 8K | Online |
How to Get Started with Free LLM APIs
- Pick a free LLM model — Click any model name to see details, rate limits, and API key signup link.
- Get your API key — Sign up on the provider's website (most require no credit card).
- Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
- Test it — Use the Playground to test your API key before integrating.
New to LLM terminology? Check the 📖 Glossary — 22 terms explained in plain English →
FAQ: Common questions about free LLM APIs →About This Free LLM API Directory
Finding reliable free LLM API resources online can be frustrating. Many developers traditionally rely on static GitHub repositories to find endpoints. While those lists are a good starting point, they often become outdated quickly, leaving you with dead links, expired API keys, and unverified rate limits.
That's why we built this dynamic, auto-updating directory. If you are looking for a reliable alternative to GitHub free LLM API lists, this page tracks over 312 free LLM models online in real-time. Whether you need a free API key for text generation, vision, or coding tasks, you can compare context windows, capabilities, and strict rate limit data side-by-side.
Our goal is to be the most accurate and comprehensive list of free AI APIs for developers. Use the filters above to find providers that don't require credit cards or phone verification, and grab your free API keys to start building immediately.