@cf/meta/llama-4-scout-17b-16e-instruct — Free API
⭐ Score: 51cloudflare-workers-ai/cf-meta-llama-4-scout-17b-16e-instruct What is @cf/meta/llama-4-scout-17b-16e-instruct?
Llama 4 Scout 17B is Meta's latest-generation small-footprint model with an exceptionally large 10M-token context window — meaning you can ingest entire codebases, multi-hour transcripts, or thousand-page documents in a single request. Running on Cloudflare's global edge network, it combines the efficiency of a 17B-parameter model (with 16 active experts via MoE) with a context size that exceeds most flagship models. On the free tier it shares the 10,000 Neurons/day pool and uses Cloudflare's native API. For developers evaluating long-context architectures or building retrieval-free RAG alternatives, this is one of the few free options that can genuinely test 10M-token prompts.
@cf/meta/llama-4-scout-17b-16e-instruct API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from Cloudflare Workers AI
More About Cloudflare Workers AI
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Cloudflare Workers AI as a free LLM API backend.
View Cloudflare Workers AI full guide →