Code Review That Actually Runs Your Code — Evan Marshall (ito.ai)
Evan Marshall, CTO of Ito, joins Agents Hour to argue that the bottleneck of code development has moved to QA — and the tools to automate it don't exist yet. Ito is a code review tool that runs your code. It intercepts every PR, spins up the impacted user flows in an isolated sandbox, and posts videos, screenshots, and run logs back to the GitHub timeline as proof. The team is fourteen people, mostly engineers. Evan has been a professional programmer for fifteen years and previously worked at Demox Labs and Rev. With model gains flattening after Opus 4.5, the harness around the model is where value gets created. Evan shares details of Ito's "carriage" — a workflow of swarmed agents inside deterministic boundaries, designed so the system benefits from agents getting smarter without compounding probabilistic errors. He explains why bigger organizations aren't hitting the 10x code velocity the X timeline promises, and what the validation layer needs to look like before they can.
Guests in this episode

Evan Marshall
ito.aiWatch on
Episode Transcript
Cold open: It's the harness
Meet Evan and Ito
Unit tests vs the integration layer
From scraping to QA
What's a harness?
What's a carriage?
Sandboxes and task decomposition
Claude Code, Opus, and Terminal Bench
Live demo: code reviews that run your code
The enterprise gap
Audience Q: Playwright integration
How to try Ito