Ollama Cloud — Free LLM API

6 free models available — credit card may be required. Get API key →

All Free Ollama Cloud Models — Context Windows & Rate Limits

Model Context Max Output Modality Rate Limit Status
llama3.1:cloud 128K 131K text Session/weekly limits (unpublished) Details
deepseek-r1:cloud 128K 131K text Session/weekly limits (unpublished) Details
qwen2.5:cloud 128K 131K text Session/weekly limits (unpublished) Details
gemma2:cloud 8K 131K text Session/weekly limits (unpublished) Details
mistral:cloud 32K 131K text Session/weekly limits (unpublished) Details
+ 400 more models 131K 131K text Session/weekly limits (unpublished) Details

Frequently Asked Questions about Ollama Cloud Free API

Is Ollama Cloud free to use?

Ollama Cloud offers a permanently free tier with 6 available models. Account creation is required, and a credit card may be needed to activate the free tier.

What models does Ollama Cloud offer for free?

Ollama Cloud provides 6 free models covering chat, reasoning use cases. Supported modalities include text. Browse the full list above with context windows and rate limits.

How do I use Ollama Cloud with Claude Code or Cursor?

Click "Details" on any model above to get one-click configuration snippets for Claude Code (cc), Cursor, Codex, and more. All Ollama Cloud models listed here use an OpenAI-compatible endpoint, so any tool that accepts a custom baseURL will work.