qwen-3-235b-a22b-instruct-2507 — Free API
Created by Alibabacerebras/qwen-3-235b-a22b-instruct-2507 What is qwen-3-235b-a22b-instruct-2507?
Qwen3-235B-A22B is Alibaba's massive 235B-parameter MoE model available for free on Cerebras Cloud's ultra-fast WSE inference hardware. With 131K context and OpenAI-compatible API, it delivers strong general-purpose chat performance at speeds far exceeding GPU-based alternatives — Cerebras' wafer-scale engine means responses stream near-instantly even at this parameter scale. The free tier allows 14,400 requests per day at 30 RPM, making it viable for moderate production workloads. No credit card is required; the main trade-off versus running this model on other providers is the 8K output limit and lower RPM compared to Groq's Llama endpoints.
qwen-3-235b-a22b-instruct-2507 API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from Cerebras
More About Cerebras
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Cerebras as a free LLM API backend.
View Cerebras full guide →