AI Agents Hour — Feb 2026 till Dec 2025
Watch and listen to past episodes, breaking AI news, guests from the industry, and technical deep dives on building AI agents.
February 24, 2026
#82
Sazabi: AI-Native Observability for Fast-Moving Teams (with Sherwood Callaway)
In this episode, Shane and Abhi sit down with Sherwood Callaway, founder of Sazabi, an AI-native observability platform designed for engineering teams that move fast. Sherwood shares his journey from building infrastructure and observability teams at Brex to realizing that modern development tools are moving at light speed, while observability tooling hasn't kept pace. While AI agents can ship thousands of lines of code per day, teams are still debugging production with the same tools they've been using for years: Datadog, Sentry, manual dashboards, and manual incident triage. Sazabi takes a radically different approach to observability centered on three core principles: 1. Less is More — Debugging an incident is as simple as asking a question. "Why is production down?" The best UI for observability is chat. 2. Logs Are All You Need — The "three pillars of observability" (logs, metrics, traces) is outdated dogma. With AI, you can accomplish everything using just logs. Logs are events, metrics are aggregated events, and traces are collections of start/end events. Logs can do it all. 3. Monitoring as We Know It is Dead — Sazabi replaces static monitors with agentic anomaly detection. Think of it as a team of staff engineers constantly watching your app for issues, investigating problems, and only escalating what matters. In this conversation, we dive into the gap between modern development and modern observability, and why the idea that “logs are all you need” is both controversial and, in Sherwood's view, correct. We also explore how Sazabi uses AI agents for root cause analysis (RCA), the philosophy behind simplifying observability for all engineers, and the company’s current status.
February 24, 2026
#65
How to Orchestrate Coding Agents with Conductor, with Charlie Holtz
Shane and Abhi welcome Charlie Holtz from Conductor to AI Agents Hour. Charlie shares how frustration with managing multiple Claude Code instances led to building Conductor. They discuss Conductor's July 2025 launch as the first agent orchestration Mac app, early design choices, and its impact on the market.
February 20, 2026
#64
AI NEWS - Something Big is Happening: Gemini 3.1 Pro, GPT-5.3-Spark, and Anthropic $30B fundraise
It's time for another AI News roundup with Shane and Abhi! This week was absolutely massive. Matt Shumer's viral article about AI automation, which describes his own job being automated in real time, has reached 84 million views. Anthropic raised $30 billion at a $380B valuation (one of the largest private raises in tech history). Claude Sonnet 4.6 launched with a 1M token context window. And the Chinese model tsunami is real: Qwen 3.5, GLM 5.0, MiniMax M2.5 (nearly Opus-level at 1/8 the cost), and DeepSeek v4 rumors.
February 12, 2026
#63
Observational Memory: The Human-Inspired Memory System for AI Agents, with Tyler Barnes
Tyler Barnes, founding engineer at Mastra, introduces Observational Memory. It is a new memory system for AI agents that achieves state-of-the-art results on LongMemEval with a completely stable context window. Unlike semantic recall (which uses RAG and invalidates prompt caching), Observational Memory compresses conversations into dense observations while maintaining a stable, fully cacheable context. The result: 94.87% accuracy on LongMemEval with GPT-5 mini. This is the highest score recorded by any memory system to date. In this conversation, Tyler explains how the system works, why it outperforms raw context, and how you can integrate it into your agents in under 20 minutes. We also dive into the research, the benchmarks, and what's next for Observational Memory.
February 10, 2026
#62
AI News: Model Wars - Opus 4.6 vs GPT-5.3-Codex + Seedance 2.0 Redefines AI Video
Shane and Abhi cover top AI stories. This week was absolutely massive! Anthropic aired Super Bowl ads mocking OpenAI's decision to put ads in ChatGPT, Opus 4.6, and GPT-5.3-Codex launched within 15 minutes of each other, and then ClawHub dropped a bombshell: 11.9% of the entire marketplace is malware. We cover everything: Anthropic's competitive jabs, the model war benchmarks, Claude's 1M token context, OpenAI's Frontier platform, the security crisis that's reshaping how people think about agent marketplaces, Kimi K2.5's domination on OpenRouter, ElevenLabs' $500M raise at $11B valuation, and the explosion of AI video generation tools. Plus: Perplexity's Model Council, Roblox 4D generation, Mistral's Voxtral Transcribe 2, and why Swyx finally admits evals actually help.
February 5, 2026
#61
Running 100 AI Agents in Parallel: Superset Cofounder Kiet Ho
Shane and Abhi welcome Kiet Ho, cofounder of Superset, to discuss how Superset evolved from simple WorkTree management into a full-featured tool with file editing, automation, cloud support, and multi-agent orchestration.
February 4, 2026
#60
AI News: OpenClaw Drama, Google DeepMind's Project Genie, Kimi K2.5
Shane and Abhi cover this week's top AI developments. The OpenClaw agent goes viral amid growing security concerns over its shifting identity. Moonshot AI's Kimi K2.5 ties for first place on Design Arena. Google DeepMind unveils Project Genie, advancing world model technology. Anthropic previews a new model and explores AI coding with Claude Cowork plugins. Plus, updates on developer tools, humanoid robots, and emerging AI models.
January 27, 2026
#59
Mastra 1.0 Recap, Today I learned, ClawdBot, and the news!
Today we discuss the Mastra 1.0 launch, we talk about the things we are learning, we discuss ClawdBot (renamed to MoltBot and now OpenClaw). We also discuss agent skills, new models, and how agents are reading more docs pages than humans.
January 20, 2026
#58
Announcing Mastra 1.0, Kyle from ElectricSQL, Security Corner & News
Today we announce Mastra is officially 1.0! We talk with Kyle from ElectricSQL (and formerly Gatsby) about Durable Streams. Allie joins us to discuss the code review security problem that can be caused by generated AI code. Finally, we cover all the AI news from the last week including ads in ChatGPT, the Langfuse/Clickhouse acquisition, and the OpenAI / Cerebres partnership.
January 14, 2026
#57
Emerging Agent Primitives, Anthropic ships, Google+Apple
Today we discuss a rising star, we talk about the emerging agent primitives, and we cover downstream effects of a world where agents are now the primary users of a lot of products/tools. We cover Anthropic news, the Google+Apple partnership, model companies going public, AI in healthcare, and much more.
January 6, 2026
#56
Ralph Wiggum, Opencode, AI News, can everyone be a developer?
Who is Ralph Wiggum and why are developers talking about him? We also discuss the rise of opencode and if everyone can be a developer now. And of course, we cover all the latest in AI news.
December 29, 2025
#55
Rebuilding Git in a weekend, 2025 AI Recap & 2026 AI Predictions
Today we talk about how Abhi tried to rebuild Git over the holiday weekend. We also review everything that happened in the world of AI and agents in 2025. Finally, we give our 2026 predictions for what we think will happen next year.