Available

+ API-Inference-enabled models

+ API-Inference-enabled models — free model from ModelScope.

+ API-Inference-enabled models — Free API Specifications

Context 131K
Max Output 131K
Modality text
Rate Limit Dynamic quotas + dynamic concurrency
Card Required Yes
OpenAI Compatible No

How to Configure + API-Inference-enabled models for Free

Base URL https://api-inference.modelscope.cn/v1
How to get an API key Get API Key →

One-Click Config for Claude Code, Cursor & More

Claude Code

# Claude Code works via OpenRouter's Anthropic-compatible API.
# Note: Only paid Anthropic Claude models are supported (e.g. claude-sonnet-4.6, claude-opus-4).
# Browse available Claude models at: https://openrouter.ai/models?q=anthropic

# Add to ~/.zshrc or ~/.bashrc
export OPENROUTER_API_KEY="<your-openrouter-api-key>"  # Get at https://openrouter.ai/settings/keys
export ANTHROPIC_BASE_URL="https://openrouter.ai/api"
export ANTHROPIC_AUTH_TOKEN="$OPENROUTER_API_KEY"
export ANTHROPIC_API_KEY=""  # Must be explicitly empty to avoid conflicts

# Optional: pin specific models for each role
# export ANTHROPIC_DEFAULT_SONNET_MODEL="anthropic/claude-sonnet-4.6"
# export ANTHROPIC_DEFAULT_HAIKU_MODEL="anthropic/claude-haiku-4.5"

# Then simply run: claude

Cursor

# Cursor → Settings (⚙️) → Models → Add Model
# Enter the model name exactly as shown, then fill in:
#   Override OpenAI Base URL: https://api-inference.modelscope.cn/v1
#   OpenAI API Key: <your-api-key>   # Get at https://modelscope.cn/my/myaccesstoken
# Click "Verify" to confirm the connection, then enable the model.
#
# Model name to add: + API-Inference-enabled models

Codex

# Add to ~/.zshrc or ~/.bashrc
export OPENAI_BASE_URL="https://api-inference.modelscope.cn/v1"
export OPENAI_API_KEY="<your-api-key>"  # Get at https://modelscope.cn/my/myaccesstoken

# Then run:
codex --model "+ API-Inference-enabled models"

Gemini CLI

# ~/.gemini/settings.json
{
  "apiKey": "<your-api-key>",
  "model": "+ API-Inference-enabled models"
}
# Get API key at https://modelscope.cn/my/myaccesstoken

OpenCode

// ~/.config/opencode/opencode.json
{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "free-llm": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "Free LLM",
      "options": {
        "baseURL": "https://api-inference.modelscope.cn/v1",
        "apiKey": "<your-api-key>"
      },
      "models": {
        "+ API-Inference-enabled models": { "name": "+ API-Inference-enabled models" }
      }
    }
  }
}
// Get API key at https://modelscope.cn/my/myaccesstoken

Hermes

# Step 1 — Edit config.yaml
# Windows: C:\Users\<you>\AppData\Local\hermes\config.yaml
# macOS/Linux: ~/.config/hermes/config.yaml

model:
  default: + API-Inference-enabled models
  provider: custom
  base_url: ${CUSTOM_BASE_URL}
  api_key: ${CUSTOM_API_KEY}
  model_aliases:
    + API-Inference-enabled models:
      model: "+ API-Inference-enabled models"
      provider: "custom"

# Step 2 — Edit .env (same directory as config.yaml)
# Windows: C:\Users\<you>\AppData\Local\hermes\.env
# macOS/Linux: ~/.config/hermes/.env

# ========================
# Custom API (OpenAI-compatible)
# ========================
CUSTOM_API_KEY=<your-api-key>        # Get at https://modelscope.cn/my/myaccesstoken
CUSTOM_BASE_URL=https://api-inference.modelscope.cn/v1

OpenClaw

// ~/.openclaw/openclaw.json  (JSON5 format)
{
  "agents": {
    "defaults": {
      "model": {
        "primary": "+ API-Inference-enabled models",
      },
    },
  },
  "models": {
    "providers": {
      // Option A — Built-in provider (OpenAI, Anthropic, Google…)
      // Just add apiKey; OpenClaw handles the baseUrl automatically
      // "openai": { "apiKey": "<your-api-key>" },

      // Option B — Custom OpenAI-compatible base URL (e.g. OpenRouter, NVIDIA)
      "free-llm": {
        "baseUrl": "https://api-inference.modelscope.cn/v1",
        "apiKey": "<your-api-key>",  // Get at https://modelscope.cn/my/myaccesstoken
        "api": "openai-completions", // openai-completions | anthropic-messages | …
        "models": [
          { "id": "+ API-Inference-enabled models", "name": "+ API-Inference-enabled models" },
        ],
      },
    },
  },
}
// Apply: openclaw gateway restart
// Verify: openclaw doctor --fix

Frequently Asked Questions about + API-Inference-enabled models

Is + API-Inference-enabled models free to use?

Yes. + API-Inference-enabled models is available on a permanently free tier via ModelScope. A credit card may be required to activate the free tier. The free tier includes a rate limit of Dynamic quotas + dynamic concurrency.

What is + API-Inference-enabled models best for?

+ API-Inference-enabled models is optimized for chat tasks. It supports text modalities, with a context window of 131K tokens and a maximum output of 131K tokens. + API-Inference-enabled models — free model from ModelScope.

Is + API-Inference-enabled models OpenAI-compatible?

+ API-Inference-enabled models does not use a standard OpenAI-compatible endpoint. Please refer to the ModelScope documentation for SDK integration details.

How do I get an API key for + API-Inference-enabled models?

Visit ModelScope's API key page to register and generate a free API key. Once you have the key, use the configuration snippets above to set up Claude Code, Cursor, or your preferred AI coding tool.