Lesson 14 of 16·5 minMark as complete

Guardrails with Processors

Name: Guardrails with Processors
Duration: 5 min
Description: Once real users are involved, you need input guardrails. Add PromptInjectionDetector and ModerationProcessor to block hostile requests before the model ever sees them.

The Theme Park agent can use tools, hold context, and plan a real park day. Once something like this is in front of real users, safety becomes a practical concern. Users won't always behave the way you expect, and some will actively try to manipulate or jailbreak the agent.

In this lesson, you'll add input-stage guardrails using two processors: PromptInjectionDetector, which intercepts prompt injection, jailbreak attempts, and system override tries before the model ever sees the message, and ModerationProcessor, which screens incoming messages for hate and harassment at the input stage.

Both processors operate exclusively at the input stage and run before the LLM is invoked. Adding your own processor slots into that same pipeline. When a message is blocked at the input stage, the LLM is never called, and nothing from that request gets written to memory.

"The model never gets pulled off task because the request does not make it through the input processor."

— Guil Hernandez

Mentioned in the lesson
Direct link to Mentioned in the lesson

Code:

Course GitHub repo

Relevant Mastra docs:

Join the community
Direct link to Join the community

Ask questions:

Discord — chat with other learners and the Mastra team
Guil on LinkedIn — ask him questions directly

Follow Mastra:

X — follow Mastra for updates
YouTube — watch more tutorials
LinkedIn — follow for updates

Guardrails with Processors

Mentioned in the lessonDirect link to Mentioned in the lesson

Join the communityDirect link to Join the community

Mentioned in the lesson
Direct link to Mentioned in the lesson

Join the community
Direct link to Join the community