AI Beats Laboratory: A Multi-Agent Music Generation System

The AI Beats Laboratory is an interactive web application that generates musical beats and melodies using AI agents. Here's how it works:

Agents

The system uses two specialized Mastra agents:

A music reference agent that analyzes musical styles and references
A music generation agent that creates drum patterns and melodies

Here's the reference agent definition:

 1export const musicReferenceAgent = new Agent({
 2  name: "music-reference-agent",
 3  instructions: `
 4    You are given a style of music, an artist or song as a reference point. 
 5    First think about what keys and what drum patterns fit this reference point.
 6    Based on this knowledge, generate a drum pattern and a minimal melody that fits the style.
 7    Pick a key based on the style of the music. All notes should be in this key.
 8    `,
 9  model: {
10    provider: "ANTHROPIC",
11    name: "claude-3-5-sonnet-20241022",
12    toolChoice: "auto",
13  },
14});

Here's the music generation agent definition:

 1export const musicAgent = new Agent({
 2  name: "music-agent",
 3  instructions: `
 4    
 5    For the pianoSequence:
 6    - Create wonderful melodies
 7    - Available notes:
 8      * High register: ['C5', 'B4', 'A4', 'G4']
 9      * Middle register: ['F4', 'E4', 'D4', 'C4']
10      * Low register: ['B3', 'A3', 'G3']
11    - Each note should have an array of step numbers (0-15)
12    For the drumSequence:
13    - Available sounds:
14      * Core rhythm: ['Kick', 'Snare', 'HiHat']
15      * Accents: ['Clap', 'OpenHat', 'Crash']
16      * Percussion: ['Tom', 'Ride', 'Shaker', 'Cowbell']
17    - Each sound should have an array of step numbers (0-15)
18    Response format must be:
19    {
20      "pianoSequence": {
21        "C5": [numbers],
22        "B4": [numbers],
23        // ... other piano notes
24      },
25      "drumSequence": {
26        "Kick": [numbers],
27        "Snare": [numbers],
28        // ... other drum sounds
29      }
30    }
31`,
32  model: anthropic("claude-3-5-sonnet-20241022"),
33});

It turns out LLMs are not very good at music so most of the time was spent iterating on the system prompt. Anthropic’s Claude 3.5 Sonnet was better than OpenAI’s 4o.

User Interface Components

The main interface is built around the Sequencer component which provides:

A 16-step grid for both piano notes and drum sounds
Interactive controls for playing/stopping sequences
Tempo controls
Export/share functionality
AI generation controls

The sequencer layout is defined in:

 1const STEPS = 16;
 2const PIANO_NOTES = [
 3  "C5",
 4  "B4",
 5  "A4",
 6  "G4",
 7  "F4",
 8  "E4",
 9  "D4",
10  "C4",
11  "B3",
12  "A3",
13  "G3",
14];
15const DRUM_SOUNDS = [
16  "Kick",
17  "Snare",
18  "HiHat",
19  "Clap",
20  "OpenHat",
21  "Tom",
22  "Crash",
23  "Ride",
24  "Shaker",
25  "Cowbell",
26];

Audio System

The application uses the Web Audio API for sound generation. The audio system is initialized with:

 1// Create a single audio context for the entire application
 2let audioContext: AudioContext | null = null;
 3
 4export const getAudioContext = () => {
 5  if (!audioContext) {
 6    audioContext = new AudioContext();
 7    // Resume audio context on creation to handle auto-play restrictions
 8    audioContext.resume();
 9  }
10  return audioContext;
11};

Piano notes are mapped to frequencies:

 1const NOTE_FREQUENCIES: { [key: string]: number } = {
 2  C5: 523.25,
 3  B4: 493.88,
 4  A4: 440.0,
 5  G4: 392.0,
 6  F4: 349.23,
 7  E4: 329.63,
 8  D4: 293.66,
 9  C4: 261.63,
10  B3: 246.94,
11  A3: 220.0,
12  G3: 196.0,
13};

Generation Flow

When a user requests a new beat:

The user enters a prompt describing their desired musical style
The music reference agent analyzes the prompt and provides musical context
The music generation agent creates patterns based on this context
The patterns are rendered in the sequencer grid

The generation process is handled in:

 1const handleGenerateSequence = async () => {
 2  if (!prompt) return;
 3  setIsGenerating(true);
 4
 5  try {
 6    const ctx = getAudioContext();
 7    ctx.resume();
 8
 9    // First, get musical analysis from reference agent
10    const refAgent =
11      getMastraFetchUrl() + "/api/agents/musicReferenceAgent/generate";
12    const response = await window.fetch(refAgent, {
13      method: "POST",
14      headers: { "Content-Type": "application/json" },
15      body: JSON.stringify({
16        messages: [`Please analyze the users request "${prompt}"`],
17      }),
18    });
19
20    const d = await response.json();
21    setReference(d.text);
22
23    // Then, generate the actual beat pattern using music agent
24    const uri = getMastraFetchUrl() + "/api/agents/musicAgent/generate";
25    const result = await window.fetch(uri, {
26      method: "POST",
27      headers: { "Content-Type": "application/json" },
28      body: JSON.stringify({
29        messages: [
30          `Please make me a beat based on this information: ${d.text}`,
31        ],
32        output: {
33          // ... JSON schema defining required notes and drum sounds
34          // Each property (C5, B4, Kick, Snare, etc.) expects an array of integers
35          // representing the steps where that note/sound should play
36        },
37      }),
38    });
39
40    const data = await result.json();
41
42    // Map the response data to piano and drum sequences
43    const pianoSequence = {
44      C5: data.object.C5 || [],
45      B4: data.object.B4 || [],
46      // ... additional piano notes C5 through G3
47    };
48
49    const drumSequence = {
50      Kick: data.object.Kick || [],
51      Snare: data.object.Snare || [],
52      // ... additional drum sounds
53    };
54
55    setDrumSequence(drumSequence);
56    setPianoSequence(pianoSequence);
57    stopSequence();
58  } catch (error) {
59    console.error("Error generating sequence:", error);
60  } finally {
61    setIsGenerating(false);
62  }
63};

The system supports:

Sharing beats via URL encoding
Exporting to MIDI format
Generating variations of existing patterns

By the way, you can find all the code on Github and try the demo yourself here.

AI Beats Laboratory: A Multi-Agent Music Generation System

Agents

User Interface Components

Audio System

Generation Flow

Sharing and Export

Stay up to date