Cloudflare

The CloudflareVoice class in Mastra provides text-to-speech capabilities using Cloudflare Workers AI. This provider specializes in efficient, low-latency speech synthesis suitable for edge computing environments.

Usage Example
Direct link to Usage Example

import { CloudflareVoice } from '@mastra/voice-cloudflare'

// Initialize with configuration
const voice = new CloudflareVoice({
  speechModel: {
    name: '@cf/meta/m2m100-1.2b',
    apiKey: 'your-cloudflare-api-token',
    accountId: 'your-cloudflare-account-id',
  },
  speaker: 'en-US-1', // Default voice
})

// Convert text to speech
const audioStream = await voice.speak('Hello, how can I help you?', {
  speaker: 'en-US-2', // Override default voice
})

// Get available voices
const speakers = await voice.getSpeakers()
console.log(speakers)

Configuration
Direct link to Configuration

Constructor Options
Direct link to Constructor Options

speechModel?:

CloudflareSpeechConfig

Configuration for text-to-speech synthesis.

CloudflareSpeechConfig

name?:

string

Model name to use for TTS.

apiKey?:

string

Cloudflare API token with Workers AI access. Falls back to CLOUDFLARE_API_TOKEN environment variable.

accountId?:

string

Cloudflare account ID. Falls back to CLOUDFLARE_ACCOUNT_ID environment variable.

speaker?:

string

= 'en-US-1'

Default voice ID for speech synthesis.

Methods
Direct link to Methods

speak()
Direct link to speak()

Converts text to speech using Cloudflare's text-to-speech service.

input:

string | NodeJS.ReadableStream

Text or text stream to convert to speech.

options?:

Options

Configuration options.

Options

speaker?:

string

Voice ID to use for speech synthesis.

format?:

string

Output audio format.

Returns: Promise<NodeJS.ReadableStream>

getSpeakers()
Direct link to getSpeakers()

Returns an array of available voice options, where each node contains:

voiceId:

string

Unique identifier for the voice (e.g., 'en-US-1')

language:

string

Language code of the voice (e.g., 'en-US')

Notes
Direct link to Notes

API tokens can be provided via constructor options or environment variables (CLOUDFLARE_API_TOKEN and CLOUDFLARE_ACCOUNT_ID)
Cloudflare Workers AI is optimized for edge computing with low latency
This provider only supports text-to-speech (TTS) functionality, not speech-to-text (STT)
The service integrates well with other Cloudflare Workers products
For production use, ensure your Cloudflare account has the appropriate Workers AI subscription
Voice options are more limited compared to some other providers, but performance at the edge is excellent

If you need speech-to-text capabilities in addition to text-to-speech, consider using one of these providers:

OpenAI - Provides both TTS and STT
Google - Provides both TTS and STT
Azure - Provides both TTS and STT

Usage ExampleDirect link to Usage Example

ConfigurationDirect link to Configuration

Constructor OptionsDirect link to Constructor Options

speechModel?:

name?:

apiKey?:

accountId?:

speaker?:

MethodsDirect link to Methods

speak()Direct link to speak()

input:

options?:

speaker?:

format?:

getSpeakers()Direct link to getSpeakers()

voiceId:

language:

NotesDirect link to Notes

Related ProvidersDirect link to Related Providers

Usage Example
Direct link to Usage Example

Configuration
Direct link to Configuration

Constructor Options
Direct link to Constructor Options

Methods
Direct link to Methods

speak()
Direct link to speak()

getSpeakers()
Direct link to getSpeakers()

Notes
Direct link to Notes

Related Providers
Direct link to Related Providers