Deepgram

The Deepgram voice implementation in Mastra provides text-to-speech (TTS) and speech-to-text (STT) capabilities using Deepgram's API. It supports multiple voice models and languages, with configurable options for both speech synthesis and transcription.

Usage example
Direct link to Usage example

import { DeepgramVoice } from '@mastra/voice-deepgram'

// Initialize with default configuration (uses DEEPGRAM_API_KEY environment variable)
const voice = new DeepgramVoice()

// Initialize with custom configuration
const voice = new DeepgramVoice({
  speechModel: {
    name: 'aura',
    apiKey: 'your-api-key',
  },
  listeningModel: {
    name: 'nova-2',
    apiKey: 'your-api-key',
  },
  speaker: 'asteria-en',
})

// Text-to-Speech
const audioStream = await voice.speak('Hello, world!')

// Speech-to-Text
const transcript = await voice.listen(audioStream)

Constructor parameters
Direct link to Constructor parameters

speechModel?:

DeepgramVoiceConfig

= { name: 'aura' }

Configuration for text-to-speech functionality.

DeepgramVoiceConfig

name?:

DeepgramModel

The Deepgram model to use

apiKey?:

string

Deepgram API key. Falls back to DEEPGRAM_API_KEY environment variable

properties?:

Record<string, any>

Additional properties to pass to the Deepgram API

language?:

string

Language code for the model

listeningModel?:

DeepgramVoiceConfig

= { name: 'nova' }

Configuration for speech-to-text functionality.

DeepgramVoiceConfig

name?:

DeepgramModel

The Deepgram model to use

apiKey?:

string

Deepgram API key. Falls back to DEEPGRAM_API_KEY environment variable

properties?:

Record<string, any>

Additional properties to pass to the Deepgram API

language?:

string

Language code for the model

speaker?:

DeepgramVoiceId

= 'asteria-en'

Default voice to use for text-to-speech

Methods
Direct link to Methods

`speak()`
Direct link to speak

Converts text to speech using the configured speech model and voice.

input:

string | NodeJS.ReadableStream

Text to convert to speech. If a stream is provided, it will be converted to text first.

options?:

object

Additional options for speech synthesis

object

speaker?:

string

Override the default speaker for this request

Returns: Promise<NodeJS.ReadableStream>

`listen()`
Direct link to listen

Converts speech to text using the configured listening model.

audioStream:

NodeJS.ReadableStream

Audio stream to transcribe

options?:

object

Additional options to pass to the Deepgram API

Returns: Promise<string>

`getSpeakers()`
Direct link to getspeakers

Returns a list of available voice options.

voiceId:

string

Unique identifier for the voice

Usage exampleDirect link to Usage example

Constructor parametersDirect link to Constructor parameters

speechModel?:

name?:

apiKey?:

properties?:

language?:

listeningModel?:

name?:

apiKey?:

properties?:

language?:

speaker?:

MethodsDirect link to Methods

speak()Direct link to speak

input:

options?:

speaker?:

listen()Direct link to listen

audioStream:

options?:

getSpeakers()Direct link to getspeakers

voiceId:

Usage example
Direct link to Usage example

Constructor parameters
Direct link to Constructor parameters

Methods
Direct link to Methods

`speak()`
Direct link to speak

`listen()`
Direct link to listen

`getSpeakers()`
Direct link to getspeakers