Code mode

Added in: @mastra/core@1.38.0

beta

This feature is in beta. Breaking changes may occur without a major version bump until the API is stable.

Code mode lets an agent run multi-tool computations in an isolated sandbox and return the result as a single, more accurate response.

Instead of calling tools one turn at a time, the model writes a tailored function for the user query. The function orchestrates your existing tools as external_* functions and reduces or aggregates their results into one structured answer.

createCodeMode() returns this tool with the default id of execute_typescript. The id is configurable, so an agent can have several code mode tools at once, each scoped to a different set of tools (see Scoping tools across multiple code tools).

When to use code mode
Direct link to When to use code mode

Use code mode when an agent touches multiple tools to answer a user query or perform a complex computation:

Fewer round-trips: A multi-tool query runs in one tool call instead of repeating the agentic loop for every tool decision.
Smaller context: The function can reduce or aggregate large tool responses before returning them to the agent.
Correct math: Sums, averages, and other arithmetic run as JavaScript, not as token prediction.
Planning up front: Filtering, aggregation, and branching happen inside the function rather than across separate turns.

How it works
Direct link to How it works

Without code mode, multi-tool queries can run the agentic loop several times. The model chooses a tool and reads the result. It repeats the process as needed.

Each turn adds the full tool response to the agent's context window which can lead to poorer reasoning and increased token usage.

With code mode, your tools keep running on the host with full validation, request context, and tracing. Only the model's orchestration code runs in the sandbox. Each external_* call is bridged back to the real tool on the host, and the function can reduce or aggregate results before returning one response to the agent.

The function runs in a Workspace sandbox. A sandbox is required, because code mode runs model-authored code and the execution boundary must be chosen deliberately. Pass one via sandbox, or run the agent in a workspace that provides one. To execute on the host machine, pass new LocalSandbox() explicitly. This runs the function as a host node process with host privileges, so only use it for trusted or local development.

Transports that bring their own execution boundary are the exception: with IsolatedVmCodeModeTransport the program runs in an in-process V8 isolate and no sandbox is needed (see In-process isolation).

Quickstart
Direct link to Quickstart

createCodeMode() returns the tool plus generated instructions. With no id, the tool uses the name execute_typescript. Add both to your agent:

import { Agent } from '@mastra/core/agent'
import { createCodeMode, createTool } from '@mastra/core/tools'
import { LocalSandbox } from '@mastra/core/workspace'
import { z } from 'zod'

const getTopProducts = createTool({
  id: 'getTopProducts',
  description: 'Get top selling products',
  inputSchema: z.object({ limit: z.number() }),
  outputSchema: z.object({
    products: z.array(z.object({ id: z.string(), name: z.string(), totalSales: z.number() })),
  }),
  execute: async ({ limit }) => fetchTopProducts(limit),
})

const getProductRatings = createTool({
  id: 'getProductRatings',
  description: 'Get ratings for a product',
  inputSchema: z.object({ productId: z.string() }),
  outputSchema: z.object({ ratings: z.array(z.object({ score: z.number() })) }),
  execute: async ({ productId }) => fetchRatings(productId),
})

const { tool, instructions } = createCodeMode({
  tools: { getTopProducts, getProductRatings },
  sandbox: new LocalSandbox(), // required; runs on the host — see "How it works"
})

const agent = new Agent({
  id: 'shop-assistant',
  name: 'shop-assistant',
  instructions: ['You are a helpful shopping assistant.', instructions],
  model: 'openai/gpt-5.6-sol',
  tools: { execute_typescript: tool },
})

Asked "What are the top 5 products and the average rating for each?", the model emits one execute_typescript call instead of many separate tool calls:

const top = await external_getTopProducts({ limit: 5 })
const ratings = await Promise.all(
  top.products.map(p => external_getProductRatings({ productId: p.id })),
)
return top.products.map((product, i) => {
  const scores = ratings[i].ratings.map(r => r.score)
  const avg = scores.reduce((sum, s) => sum + s, 0) / scores.length
  return {
    name: product.name,
    sales: product.totalSales,
    averageRating: Math.round(avg * 100) / 100,
  }
})

All five rating lookups run in parallel, the averages are computed in JavaScript, and the agent receives one structured result.

For createCodeMode() to work well, keep these tips in mind:

Keep tools focused, so each does one thing well and the model composes them in code.
Code mode helps most when calls can be parallelized with Promise.all.

Visit the createCodeMode() reference for configuration options, return values, result shape, and instructions inspection.

Scoping tools across multiple code tools
Direct link to Scoping tools across multiple code tools

createCodeMode() captures its own allow-list. Call it more than once to give an agent several code tools, each scoped to a different subset of tools. It can only call the external_* functions for the tools passed to its own createCodeMode() call, so the subsets stay isolated.

Give each tool a distinct id so their ids don't collide, and add each tool's instructions to the agent:

const sales = createCodeMode({
  id: 'sales_code',
  tools: { listRecentOrders, getCustomer },
  sandbox,
})

const inventory = createCodeMode({
  id: 'inventory_code',
  tools: { listProducts, getSupplier },
  sandbox,
})

const agent = new Agent({
  id: 'ops-assistant',
  name: 'ops-assistant',
  instructions: ['You are an ops assistant.', sales.instructions, inventory.instructions],
  model: 'openai/gpt-5.6-sol',
  tools: { sales_code: sales.tool, inventory_code: inventory.tool },
})

The generated code for sales_code can't call an inventory tool, and the reverse holds too. Use this for least-privilege scoping and to keep each tool's prompt surface small.

Remote sandboxes
Direct link to Remote sandboxes

By default, code mode uses a transport that writes the program to the host filesystem and runs node against it. That works for LocalSandbox, which shares the host, but not for remote sandboxes that run in their own micro-VM (such as E2B), where the host paths don't exist.

Remote sandboxes need a transport that writes the program into the sandbox filesystem. For E2B, pass the included E2BCodeModeTransport as the second argument to createCodeMode:

import { createCodeMode } from '@mastra/core/tools'
import { E2BSandbox, E2BCodeModeTransport } from '@mastra/e2b'

const { tool, instructions } = createCodeMode(
  { tools, sandbox: new E2BSandbox() },
  new E2BCodeModeTransport(),
)

In-process isolation
Direct link to In-process isolation

To get a secure boundary without spawning processes or running a remote sandbox, use IsolatedVmCodeModeTransport from @mastra/isolated-vm. It runs the program in an in-process V8 isolate, so no sandbox is required: the isolate has no filesystem, network, or process access, and its only capabilities are the external_* functions bridged back to your tools on the host.

import { createCodeMode } from '@mastra/core/tools'
import { IsolatedVmCodeModeTransport } from '@mastra/isolated-vm'

const { tool, instructions } = createCodeMode(
  { tools }, // no sandbox needed
  new IsolatedVmCodeModeTransport({ memoryLimitMb: 128 }),
)

isolated-vm is a native addon, and on Node.js 20 and later the host process must be started with the --no-node-snapshot flag. See the IsolatedVmCodeModeTransport reference for setup details.

When to use code modeDirect link to When to use code mode

How it worksDirect link to How it works

QuickstartDirect link to Quickstart

Scoping tools across multiple code toolsDirect link to Scoping tools across multiple code tools

Remote sandboxesDirect link to Remote sandboxes

In-process isolationDirect link to In-process isolation

RelatedDirect link to Related