Working Memory

While conversation history and semantic recall help agents remember conversations, working memory allows them to maintain persistent information about users across interactions.

Think of it as the agent's active thoughts or scratchpad – the key information they keep available about the user or task. It's similar to how a person would naturally remember someone's name, preferences, or important details during a conversation.

This is useful for maintaining ongoing state that's always relevant and should always be available to the agent.

Working memory can persist at two different scopes:

Thread-scoped (default): Memory is isolated per conversation thread
Resource-scoped: Memory persists across all conversation threads for the same user

Important: Switching between scopes means the agent won't see memory from the other scope - thread-scoped memory is completely separate from resource-scoped memory.

Quick StartDirect link to Quick Start

Here's a minimal example of setting up an agent with working memory:

import { Agent } from "@mastra/core/agent";
import { Memory } from "@mastra/memory";
import { openai } from "@ai-sdk/openai";

// Create agent with working memory enabled
const agent = new Agent({
  name: "PersonalAssistant",
  instructions: "You are a helpful personal assistant.",
  model: openai("gpt-4o"),
  memory: new Memory({
    options: {
      workingMemory: {
        enabled: true,
      },
    },
  }),
});

How it WorksDirect link to How it Works

Working memory is a block of Markdown text that the agent is able to update over time to store continuously relevant information:

Memory Persistence ScopesDirect link to Memory Persistence Scopes

Working memory can operate in two different scopes, allowing you to choose how memory persists across conversations:

Thread-Scoped Memory (Default)Direct link to Thread-Scoped Memory (Default)

By default, working memory is scoped to individual conversation threads. Each thread maintains its own isolated memory:

const memory = new Memory({
  storage,
  options: {
    workingMemory: {
      enabled: true,
      scope: 'thread', // Default - memory is isolated per thread
      template: `# User Profile
- **Name**:
- **Interests**:
- **Current Goal**:
`,
    },
  },
});

Use cases:

Different conversations about separate topics
Temporary or session-specific information
Workflows where each thread needs working memory but threads are ephemeral and not related to each other

Resource-Scoped MemoryDirect link to Resource-Scoped Memory

Resource-scoped memory persists across all conversation threads for the same user (resourceId), enabling persistent user memory:

const memory = new Memory({
  storage,
  options: {
    workingMemory: {
      enabled: true,
      scope: "resource", // Memory is isolated per thread
      template: `# User Profile
- **Name**:
- **Interests**:
- **Current Goal**:
`,
    },
  },
});

Use cases:

Different conversations about separate topics
Temporary or session-specific information
Workflows where each thread needs working memory but threads are ephemeral and not related to each other

Storage Adapter SupportDirect link to Storage Adapter Support

Resource-scoped working memory requires specific storage adapters that support the mastra_resources table:

✅ Supported Storage AdaptersDirect link to ✅ Supported Storage Adapters

LibSQL (@mastra/libsql)
PostgreSQL (@mastra/pg)
Upstash (@mastra/upstash)

Custom TemplatesDirect link to Custom Templates

Templates guide the agent on what information to track and update in working memory. While a default template is used if none is provided, you'll typically want to define a custom template tailored to your agent's specific use case to ensure it remembers the most relevant information.

Here's an example of a custom template. In this example the agent will store the users name, location, timezone, etc as soon as the user sends a message containing any of the info:

const memory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `
# User Profile

## Personal Info

- Name:
- Location:
- Timezone:

## Preferences

- Communication Style: [e.g., Formal, Casual]
- Project Goal:
- Key Deadlines:
  - [Deadline 1]: [Date]
  - [Deadline 2]: [Date]

## Session State

- Last Task Discussed:
- Open Questions:
  - [Question 1]
  - [Question 2]
`,
    },
  },
});

Designing Effective TemplatesDirect link to Designing Effective Templates

A well-structured template keeps the information easy for the agent to parse and update. Treat the template as a short form that you want the assistant to keep up to date.

Short, focused labels. Avoid paragraphs or very long headings. Keep labels brief (for example ## Personal Info or - Name:) so updates are easy to read and less likely to be truncated.
Use consistent casing. Inconsistent capitalization (Timezone: vs timezone:) can cause messy updates. Stick to Title Case or lower case for headings and bullet labels.
Keep placeholder text simple. Use hints such as [e.g., Formal] or [Date] to help the LLM fill in the correct spots.
Abbreviate very long values. If you only need a short form, include guidance like - Name: [First name or nickname] or - Address (short): rather than the full legal text.
Mention update rules in instructions. You can instruct how and when to fill or clear parts of the template directly in the agent's instructions field.

Alternative Template StylesDirect link to Alternative Template Styles

Use a shorter single block if you only need a few items:

const basicMemory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `User Facts:\n- Name:\n- Favorite Color:\n- Current Topic:`,
    },
  },
});

You can also store the key facts in a short paragraph format if you prefer a more narrative style:

const paragraphMemory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `Important Details:\n\nKeep a short paragraph capturing the user's important facts (name, main goal, current task).`,
    },
  },
});

Structured Working MemoryDirect link to Structured Working Memory

Working memory can also be defined using a structured schema instead of a Markdown template. This allows you to specify the exact fields and types that should be tracked, using a Zod schema. When using a schema, the agent will see and update working memory as a JSON object matching your schema.

Important: You must specify either template or schema, but not both.

Example: Schema-Based Working MemoryDirect link to Example: Schema-Based Working Memory

import { z } from "zod";
import { Memory } from "@mastra/memory";

const userProfileSchema = z.object({
  name: z.string().optional(),
  location: z.string().optional(),
  timezone: z.string().optional(),
  preferences: z
    .object({
      communicationStyle: z.string().optional(),
      projectGoal: z.string().optional(),
      deadlines: z.array(z.string()).optional(),
    })
    .optional(),
});

const memory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      schema: userProfileSchema,
      // template: ... (do not set)
    },
  },
});

When a schema is provided, the agent receives the working memory as a JSON object. For example:

{
  "name": "Sam",
  "location": "Berlin",
  "timezone": "CET",
  "preferences": {
    "communicationStyle": "Formal",
    "projectGoal": "Launch MVP",
    "deadlines": ["2025-07-01"]
  }
}

Choosing Between Template and SchemaDirect link to Choosing Between Template and Schema

Use a template (Markdown) if you want the agent to maintain memory as a free-form text block, such as a user profile or scratchpad.
Use a schema if you need structured, type-safe data that can be validated and programmatically accessed as JSON.
Only one mode can be active at a time: setting both template and schema is not supported.

Example: Multi-step RetentionDirect link to Example: Multi-step Retention

Below is a simplified view of how the User Profile template updates across a short user conversation:

# User Profile

## Personal Info

- Name:
- Location:
- Timezone:

--- After user says "My name is **Sam** and I'm from **Berlin**" ---

# User Profile
- Name: Sam
- Location: Berlin
- Timezone:

--- After user adds "By the way I'm normally in **CET**" ---

# User Profile
- Name: Sam
- Location: Berlin
- Timezone: CET

The agent can now refer to Sam or Berlin in later responses without requesting the information again because it has been stored in working memory.

If your agent is not properly updating working memory when you expect it to, you can add system instructions on how and when to use this template in your agent's instructions setting.

Setting Initial Working MemoryDirect link to Setting Initial Working Memory

While agents typically update working memory through the updateWorkingMemory tool, you can also set initial working memory programmatically when creating or updating threads. This is useful for injecting user data (like their name, preferences, or other info) that you want available to the agent without passing it in every request.

Setting Working Memory via Thread MetadataDirect link to Setting Working Memory via Thread Metadata

When creating a thread, you can provide initial working memory through the metadata's workingMemory key:

src/app/medical-consultation.ts
// Create a thread with initial working memory
const thread = await memory.createThread({
  threadId: "thread-123",
  resourceId: "user-456",
  title: "Medical Consultation",
  metadata: {
    workingMemory: `# Patient Profile
- Name: John Doe
- Blood Type: O+
- Allergies: Penicillin
- Current Medications: None
- Medical History: Hypertension (controlled)
`,
  },
});

// The agent will now have access to this information in all messages
await agent.generate("What's my blood type?", {
  threadId: thread.id,
  resourceId: "user-456",
});
// Response: "Your blood type is O+."

Updating Working Memory ProgrammaticallyDirect link to Updating Working Memory Programmatically

You can also update an existing thread's working memory:

src/app/medical-consultation.ts
// Update thread metadata to add/modify working memory
await memory.updateThread({
  id: "thread-123",
  title: thread.title,
  metadata: {
    ...thread.metadata,
    workingMemory: `# Patient Profile
- Name: John Doe
- Blood Type: O+
- Allergies: Penicillin, Ibuprofen  // Updated
- Current Medications: Lisinopril 10mg daily  // Added
- Medical History: Hypertension (controlled)
`,
  },
});

Direct Memory UpdateDirect link to Direct Memory Update

Alternatively, use the updateWorkingMemory method directly:

src/app/medical-consultation.ts
await memory.updateWorkingMemory({
  threadId: "thread-123",
  resourceId: "user-456", // Required for resource-scoped memory
  workingMemory: "Updated memory content...",
});

ExamplesDirect link to Examples

Working memory with template
Working memory with schema
Per-resource working memory - Complete example showing resource-scoped memory persistence