# ![LLM Gateway logo](https://models.dev/logos/llmgateway.svg)LLM Gateway Access 199 LLM Gateway models through Mastra's model router. Authentication is handled automatically using the `LLMGATEWAY_API_KEY` environment variable. Learn more in the [LLM Gateway documentation](https://llmgateway.io/docs). ```bash LLMGATEWAY_API_KEY=your-api-key ``` ```typescript import { Agent } from "@mastra/core/agent"; const agent = new Agent({ id: "my-agent", name: "My Agent", instructions: "You are a helpful assistant", model: "llmgateway/auto" }); // Generate a response const response = await agent.generate("Hello!"); // Stream a response const stream = await agent.stream("Tell me a story"); for await (const chunk of stream) { console.log(chunk); } ``` > **Info:** Mastra uses the OpenAI-compatible `/chat/completions` endpoint. Some provider-specific features may not be available. Check the [LLM Gateway documentation](https://llmgateway.io/docs) for details. ## Models | Model | Context | Tools | Reasoning | Image | Audio | Video | Input $/1M | Output $/1M | | -------------------------------------------------- | ------- | ----- | --------- | ----- | ----- | ----- | ---------- | ----------- | | `llmgateway/auto` | 128K | | | | | | — | — | | `llmgateway/claude-3-5-sonnet` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-3-7-sonnet` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-3-7-sonnet-20250219` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-3-haiku` | 200K | | | | | | $0.25 | $1 | | `llmgateway/claude-3-haiku-20240307` | 200K | | | | | | $0.25 | $1 | | `llmgateway/claude-3-opus` | 200K | | | | | | $15 | $75 | | `llmgateway/claude-haiku-4-5` | 200K | | | | | | $1 | $5 | | `llmgateway/claude-haiku-4-5-20251001` | 200K | | | | | | $1 | $5 | | `llmgateway/claude-opus-4-1-20250805` | 200K | | | | | | $15 | $75 | | `llmgateway/claude-opus-4-20250514` | 200K | | | | | | $15 | $75 | | `llmgateway/claude-opus-4-5-20251101` | 200K | | | | | | $5 | $25 | | `llmgateway/claude-opus-4-6` | 1.0M | | | | | | $5 | $25 | | `llmgateway/claude-sonnet-4-20250514` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-sonnet-4-5` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-sonnet-4-5-20250929` | 200K | | | | | | $3 | $15 | | `llmgateway/claude-sonnet-4-6` | 200K | | | | | | $3 | $15 | | `llmgateway/codestral-2508` | 256K | | | | | | $0.30 | $0.90 | | `llmgateway/cogview-4` | 2K | | | | | | — | — | | `llmgateway/custom` | 128K | | | | | | — | — | | `llmgateway/deepseek-r1-0528` | 64K | | | | | | $0.80 | $2 | | `llmgateway/deepseek-v3.1` | 128K | | | | | | $0.56 | $2 | | `llmgateway/deepseek-v3.2` | 164K | | | | | | $0.28 | $0.42 | | `llmgateway/devstral-2512` | 262K | | | | | | $0.40 | $2 | | `llmgateway/devstral-small-2507` | 131K | | | | | | $0.10 | $0.30 | | `llmgateway/gemini-2.5-flash` | 1.0M | | | | | | $0.30 | $3 | | `llmgateway/gemini-2.5-flash-image` | 33K | | | | | | $0.30 | $30 | | `llmgateway/gemini-2.5-flash-image-preview` | 33K | | | | | | $0.30 | $3 | | `llmgateway/gemini-2.5-flash-lite` | 1.0M | | | | | | $0.10 | $0.40 | | `llmgateway/gemini-2.5-flash-lite-preview-09-2025` | 1.0M | | | | | | $0.10 | $0.40 | | `llmgateway/gemini-2.5-pro` | 1.0M | | | | | | $1 | $10 | | `llmgateway/gemini-3-flash-preview` | 1.0M | | | | | | $0.50 | $3 | | `llmgateway/gemini-3-pro-image-preview` | 66K | | | | | | $2 | $12 | | `llmgateway/gemini-3.1-flash-image-preview` | 66K | | | | | | $0.25 | $2 | | `llmgateway/gemini-3.1-flash-lite-preview` | 1.0M | | | | | | $0.25 | $2 | | `llmgateway/gemini-3.1-pro-preview` | 1.0M | | | | | | $2 | $12 | | `llmgateway/gemini-pro-latest` | 1.0M | | | | | | $2 | $12 | | `llmgateway/gemma-2-27b-it-together` | 8K | | | | | | $0.08 | $0.08 | | `llmgateway/gemma-3-12b-it` | 1.0M | | | | | | $0.08 | $0.30 | | `llmgateway/gemma-3-1b-it` | 1.0M | | | | | | $0.08 | $0.30 | | `llmgateway/gemma-3-27b` | 128K | | | | | | $0.27 | $0.27 | | `llmgateway/gemma-3-4b-it` | 1.0M | | | | | | $0.08 | $0.30 | | `llmgateway/gemma-3n-e2b-it` | 1.0M | | | | | | $0.08 | $0.30 | | `llmgateway/gemma-3n-e4b-it` | 1.0M | | | | | | $0.08 | $0.30 | | `llmgateway/glm-4-32b-0414-128k` | 128K | | | | | | $0.10 | $0.10 | | `llmgateway/glm-4.5` | 128K | | | | | | $0.60 | $2 | | `llmgateway/glm-4.5-air` | 128K | | | | | | $0.20 | $1 | | `llmgateway/glm-4.5-airx` | 128K | | | | | | $1 | $5 | | `llmgateway/glm-4.5-flash` | 128K | | | | | | — | — | | `llmgateway/glm-4.5-x` | 128K | | | | | | $2 | $9 | | `llmgateway/glm-4.5v` | 128K | | | | | | $0.60 | $2 | | `llmgateway/glm-4.6` | 200K | | | | | | $0.60 | $2 | | `llmgateway/glm-4.6v` | 128K | | | | | | $0.30 | $0.90 | | `llmgateway/glm-4.6v-flash` | 128K | | | | | | — | — | | `llmgateway/glm-4.6v-flashx` | 128K | | | | | | $0.04 | $0.40 | | `llmgateway/glm-4.7` | 200K | | | | | | $0.60 | $2 | | `llmgateway/glm-4.7-flash` | 200K | | | | | | — | — | | `llmgateway/glm-4.7-flashx` | 200K | | | | | | $0.07 | $0.40 | | `llmgateway/glm-5` | 203K | | | | | | $1 | $3 | | `llmgateway/glm-image` | 2K | | | | | | — | — | | `llmgateway/gpt-3.5-turbo` | 16K | | | | | | $0.50 | $2 | | `llmgateway/gpt-4` | 8K | | | | | | $30 | $60 | | `llmgateway/gpt-4-turbo` | 128K | | | | | | $10 | $30 | | `llmgateway/gpt-4.1` | 1.0M | | | | | | $2 | $8 | | `llmgateway/gpt-4.1-mini` | 1.0M | | | | | | $0.40 | $2 | | `llmgateway/gpt-4.1-nano` | 1.0M | | | | | | $0.10 | $0.40 | | `llmgateway/gpt-4o` | 128K | | | | | | $3 | $10 | | `llmgateway/gpt-4o-mini` | 128K | | | | | | $0.15 | $0.60 | | `llmgateway/gpt-4o-mini-search-preview` | 128K | | | | | | $0.15 | $0.60 | | `llmgateway/gpt-4o-search-preview` | 128K | | | | | | $3 | $10 | | `llmgateway/gpt-5` | 400K | | | | | | $1 | $10 | | `llmgateway/gpt-5-chat-latest` | 400K | | | | | | $1 | $10 | | `llmgateway/gpt-5-mini` | 400K | | | | | | $0.25 | $2 | | `llmgateway/gpt-5-nano` | 400K | | | | | | $0.05 | $0.40 | | `llmgateway/gpt-5-pro` | 400K | | | | | | $15 | $120 | | `llmgateway/gpt-5.1` | 400K | | | | | | $1 | $10 | | `llmgateway/gpt-5.1-codex` | 400K | | | | | | $1 | $10 | | `llmgateway/gpt-5.1-codex-mini` | 400K | | | | | | $0.25 | $2 | | `llmgateway/gpt-5.2` | 400K | | | | | | $2 | $14 | | `llmgateway/gpt-5.2-chat-latest` | 128K | | | | | | $2 | $14 | | `llmgateway/gpt-5.2-codex` | 400K | | | | | | $2 | $14 | | `llmgateway/gpt-5.2-pro` | 400K | | | | | | $21 | $168 | | `llmgateway/gpt-5.3-chat-latest` | 128K | | | | | | $2 | $14 | | `llmgateway/gpt-5.3-codex` | 400K | | | | | | $2 | $14 | | `llmgateway/gpt-5.4` | 1.1M | | | | | | $3 | $15 | | `llmgateway/gpt-5.4-mini` | 400K | | | | | | $0.75 | $5 | | `llmgateway/gpt-5.4-nano` | 400K | | | | | | $0.20 | $1 | | `llmgateway/gpt-5.4-pro` | 1.1M | | | | | | $30 | $180 | | `llmgateway/gpt-oss-120b` | 131K | | | | | | $0.15 | $0.75 | | `llmgateway/gpt-oss-20b` | 131K | | | | | | $0.10 | $0.50 | | `llmgateway/grok-3` | 131K | | | | | | $3 | $15 | | `llmgateway/grok-4` | 256K | | | | | | $3 | $15 | | `llmgateway/grok-4-0709` | 256K | | | | | | $3 | $15 | | `llmgateway/grok-4-1-fast` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-4-1-fast-non-reasoning` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-4-1-fast-reasoning` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-4-20-beta-0309-non-reasoning` | 2.0M | | | | | | $2 | $6 | | `llmgateway/grok-4-20-beta-0309-reasoning` | 2.0M | | | | | | $2 | $6 | | `llmgateway/grok-4-20-multi-agent-beta-0309` | 2.0M | | | | | | $2 | $6 | | `llmgateway/grok-4-fast` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-4-fast-non-reasoning` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-4-fast-reasoning` | 2.0M | | | | | | $0.20 | $0.50 | | `llmgateway/grok-code-fast-1` | 256K | | | | | | $0.20 | $2 | | `llmgateway/grok-imagine-image` | 2K | | | | | | — | — | | `llmgateway/grok-imagine-image-pro` | 2K | | | | | | — | — | | `llmgateway/hermes-2-pro-llama-3-8b` | 8K | | | | | | $0.14 | $0.14 | | `llmgateway/kimi-k2` | 131K | | | | | | $1 | $3 | | `llmgateway/kimi-k2-thinking` | 262K | | | | | | $0.60 | $3 | | `llmgateway/kimi-k2-thinking-turbo` | 262K | | | | | | $1 | $8 | | `llmgateway/kimi-k2.5` | 262K | | | | | | $0.60 | $3 | | `llmgateway/llama-3-70b-instruct` | 8K | | | | | | $0.51 | $0.74 | | `llmgateway/llama-3-8b-instruct` | 8K | | | | | | $0.04 | $0.04 | | `llmgateway/llama-3.1-70b-instruct` | 128K | | | | | | $0.72 | $0.72 | | `llmgateway/llama-3.1-8b-instruct` | 128K | | | | | | $0.22 | $0.22 | | `llmgateway/llama-3.1-nemotron-ultra-253b` | 128K | | | | | | $0.60 | $2 | | `llmgateway/llama-3.2-11b-instruct` | 128K | | | | | | $0.07 | $0.33 | | `llmgateway/llama-3.2-3b-instruct` | 33K | | | | | | $0.03 | $0.05 | | `llmgateway/llama-3.3-70b-instruct` | 128K | | | | | | $0.13 | $0.40 | | `llmgateway/llama-4-maverick-17b-instruct` | 8K | | | | | | $0.24 | $0.97 | | `llmgateway/llama-4-scout` | 33K | | | | | | $0.18 | $0.59 | | `llmgateway/llama-4-scout-17b-instruct` | 8K | | | | | | $0.17 | $0.66 | | `llmgateway/llama-guard-4-12b` | 131K | | | | | | $0.20 | $0.20 | | `llmgateway/minimax-m2` | 197K | | | | | | $0.20 | $1 | | `llmgateway/minimax-m2.1` | 197K | | | | | | $0.27 | $1 | | `llmgateway/minimax-m2.1-lightning` | 197K | | | | | | $0.12 | $0.48 | | `llmgateway/minimax-m2.5` | 205K | | | | | | $0.30 | $1 | | `llmgateway/minimax-m2.5-highspeed` | 205K | | | | | | $0.60 | $2 | | `llmgateway/minimax-m2.7` | 205K | | | | | | $0.30 | $1 | | `llmgateway/minimax-m2.7-highspeed` | 205K | | | | | | $0.60 | $2 | | `llmgateway/minimax-text-01` | 1.0M | | | | | | $0.20 | $1 | | `llmgateway/ministral-14b-2512` | 262K | | | | | | $0.20 | $0.20 | | `llmgateway/ministral-3b-2512` | 131K | | | | | | $0.10 | $0.10 | | `llmgateway/ministral-8b-2512` | 262K | | | | | | $0.15 | $0.15 | | `llmgateway/mistral-large-2512` | 262K | | | | | | $0.50 | $2 | | `llmgateway/mistral-large-latest` | 128K | | | | | | $4 | $12 | | `llmgateway/mistral-small-2506` | 128K | | | | | | $0.10 | $0.30 | | `llmgateway/mixtral-8x7b-instruct-together` | 33K | | | | | | $0.06 | $0.06 | | `llmgateway/o1` | 200K | | | | | | $15 | $60 | | `llmgateway/o3` | 200K | | | | | | $2 | $8 | | `llmgateway/o3-mini` | 200K | | | | | | $1 | $4 | | `llmgateway/o4-mini` | 200K | | | | | | $1 | $4 | | `llmgateway/pixtral-large-latest` | 128K | | | | | | $4 | $12 | | `llmgateway/qwen-coder-plus` | 131K | | | | | | $1 | $5 | | `llmgateway/qwen-flash` | 1.0M | | | | | | $0.05 | $0.40 | | `llmgateway/qwen-image` | 2K | | | | | | — | — | | `llmgateway/qwen-image-edit-max` | 2K | | | | | | — | — | | `llmgateway/qwen-image-edit-plus` | 2K | | | | | | — | — | | `llmgateway/qwen-image-max` | 2K | | | | | | — | — | | `llmgateway/qwen-image-max-2025-12-30` | 2K | | | | | | — | — | | `llmgateway/qwen-image-plus` | 2K | | | | | | — | — | | `llmgateway/qwen-max` | 131K | | | | | | $2 | $6 | | `llmgateway/qwen-max-latest` | 131K | | | | | | $2 | $6 | | `llmgateway/qwen-omni-turbo` | 33K | | | | | | $0.20 | $0.80 | | `llmgateway/qwen-plus` | 131K | | | | | | $0.40 | $1 | | `llmgateway/qwen-plus-latest` | 1.0M | | | | | | $0.40 | $1 | | `llmgateway/qwen-turbo` | 1.0M | | | | | | $0.05 | $0.20 | | `llmgateway/qwen-vl-max` | 131K | | | | | | $0.80 | $3 | | `llmgateway/qwen-vl-plus` | 131K | | | | | | $0.21 | $0.64 | | `llmgateway/qwen2-5-vl-32b-instruct` | 131K | | | | | | $1 | $4 | | `llmgateway/qwen2-5-vl-72b-instruct` | 33K | | | | | | $0.13 | $0.40 | | `llmgateway/qwen25-coder-7b` | 33K | | | | | | $0.01 | $0.03 | | `llmgateway/qwen3-235b-a22b-fp8` | 41K | | | | | | $0.20 | $0.80 | | `llmgateway/qwen3-235b-a22b-instruct-2507` | 262K | | | | | | $0.20 | $0.60 | | `llmgateway/qwen3-235b-a22b-thinking-2507` | 262K | | | | | | $0.20 | $0.60 | | `llmgateway/qwen3-30b-a3b-fp8` | 41K | | | | | | $0.09 | $0.45 | | `llmgateway/qwen3-30b-a3b-instruct-2507` | 262K | | | | | | $0.10 | $0.30 | | `llmgateway/qwen3-30b-a3b-thinking-2507` | 262K | | | | | | $0.10 | $0.30 | | `llmgateway/qwen3-32b` | 33K | | | | | | $0.10 | $0.30 | | `llmgateway/qwen3-32b-fp8` | 41K | | | | | | $0.10 | $0.45 | | `llmgateway/qwen3-4b-fp8` | 128K | | | | | | $0.03 | $0.03 | | `llmgateway/qwen3-coder-30b-a3b-instruct` | 262K | | | | | | $0.10 | $0.30 | | `llmgateway/qwen3-coder-480b-a35b-instruct` | 262K | | | | | | $0.40 | $2 | | `llmgateway/qwen3-coder-flash` | 1.0M | | | | | | $0.30 | $2 | | `llmgateway/qwen3-coder-next` | 262K | | | | | | $0.11 | $0.68 | | `llmgateway/qwen3-coder-plus` | 1.0M | | | | | | $6 | $60 | | `llmgateway/qwen3-max` | 256K | | | | | | $3 | $15 | | `llmgateway/qwen3-max-2026-01-23` | 262K | | | | | | $1 | $6 | | `llmgateway/qwen3-next-80b-a3b-instruct` | 129K | | | | | | $0.50 | $2 | | `llmgateway/qwen3-next-80b-a3b-thinking` | 131K | | | | | | $0.50 | $6 | | `llmgateway/qwen3-vl-235b-a22b-instruct` | 131K | | | | | | $0.50 | $2 | | `llmgateway/qwen3-vl-235b-a22b-thinking` | 131K | | | | | | $0.50 | $2 | | `llmgateway/qwen3-vl-30b-a3b-instruct` | 131K | | | | | | $0.20 | $0.70 | | `llmgateway/qwen3-vl-30b-a3b-thinking` | 131K | | | | | | $0.20 | $1 | | `llmgateway/qwen3-vl-8b-instruct` | 131K | | | | | | $0.08 | $0.50 | | `llmgateway/qwen3-vl-flash` | 262K | | | | | | $0.05 | $0.40 | | `llmgateway/qwen3-vl-plus` | 262K | | | | | | $0.20 | $2 | | `llmgateway/qwen35-397b-a17b` | 262K | | | | | | $0.60 | $4 | | `llmgateway/qwq-plus` | 131K | | | | | | $0.80 | $2 | | `llmgateway/seed-1-6-250615` | 256K | | | | | | $0.25 | $2 | | `llmgateway/seed-1-6-250915` | 256K | | | | | | $0.25 | $2 | | `llmgateway/seed-1-6-flash-250715` | 256K | | | | | | $0.07 | $0.30 | | `llmgateway/seed-1-8-251228` | 256K | | | | | | $0.25 | $2 | | `llmgateway/seedream-4-0` | 2K | | | | | | — | — | | `llmgateway/seedream-4-5` | 2K | | | | | | — | — | | `llmgateway/sonar` | 130K | | | | | | $1 | $1 | | `llmgateway/sonar-pro` | 200K | | | | | | $3 | $15 | | `llmgateway/sonar-reasoning-pro` | 128K | | | | | | $2 | $8 | | `llmgateway/veo-3.1-fast-generate-preview` | 33K | | | | | | — | — | | `llmgateway/veo-3.1-generate-preview` | 33K | | | | | | — | — | ## Advanced configuration ### Custom headers ```typescript const agent = new Agent({ id: "custom-agent", name: "custom-agent", model: { url: "https://api.llmgateway.io/v1", id: "llmgateway/auto", apiKey: process.env.LLMGATEWAY_API_KEY, headers: { "X-Custom-Header": "value" } } }); ``` ### Dynamic model selection ```typescript const agent = new Agent({ id: "dynamic-agent", name: "Dynamic Agent", model: ({ requestContext }) => { const useAdvanced = requestContext.task === "complex"; return useAdvanced ? "llmgateway/veo-3.1-generate-preview" : "llmgateway/auto"; } }); ```