Gemini 2.5 Flash — Free API

Created by Google ⭐ Score: 52
google-gemini/gemini-2-5-flash
🛠️ Function Calling JSON Mode chat

What is Gemini 2.5 Flash?

The Google Gemini 2.5 Flash is a powerful 2-3 sentence LLM model ideal for chat applications, generating up to 65,000 tokens from a 1 million token context. Developers can utilize its capability for conversational AI without requiring a credit card. (Practical note: Note the 10 RPM, 250 RPD rate limit.)

Model ID
gemini-2-5-flash
Base URL
https://generativelanguage.googleapis.com/v1beta

Gemini 2.5 Flash API Code Example

Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.

from openai import OpenAI

client = OpenAI(
    base_url="https://generativelanguage.googleapis.com/v1beta",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="gemini-2-5-flash",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://generativelanguage.googleapis.com/v1beta",
  apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
  model: "gemini-2-5-flash",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);
curl https://generativelanguage.googleapis.com/v1beta/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "gemini-2-5-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Other Free Models from Google Gemini

More About Google Gemini

How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Google Gemini as a free LLM API backend.

View Google Gemini full guide →