Ollama Cloud — Free LLM API
6 free models available — credit card may be required. Get API key →
All Free Ollama Cloud Models — Context Windows & Rate Limits
| Model | Context | Max Output | Modality | Rate Limit | Status | |
|---|---|---|---|---|---|---|
| llama3.1:cloud | 128K | 131K | Session/weekly limits (unpublished) | Details | ||
| deepseek-r1:cloud | 128K | 131K | Session/weekly limits (unpublished) | Details | ||
| qwen2.5:cloud | 128K | 131K | Session/weekly limits (unpublished) | Details | ||
| gemma2:cloud | 8K | 131K | Session/weekly limits (unpublished) | Details | ||
| mistral:cloud | 32K | 131K | Session/weekly limits (unpublished) | Details | ||
| + 400 more models | 131K | 131K | Session/weekly limits (unpublished) | Details |
Frequently Asked Questions about Ollama Cloud Free API
Is Ollama Cloud free to use?
Ollama Cloud offers a permanently free tier with 6 available models. Account creation is required, and a credit card may be needed to activate the free tier.
What models does Ollama Cloud offer for free?
Ollama Cloud provides 6 free models covering chat, reasoning use cases. Supported modalities include text. Browse the full list above with context windows and rate limits.
How do I use Ollama Cloud with Claude Code or Cursor?
Click "Details" on any model above to get one-click configuration snippets for Claude Code (cc), Cursor, Codex, and more.
All Ollama Cloud models listed here use an OpenAI-compatible endpoint, so any tool that accepts a custom baseURL will work.