Skip to content

strands-deepgram

strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram’s AI platform with 30+ language support.

Installation

pip install strands-deepgram

Usage

from strands import Agent
from strands_deepgram import deepgram

agent = Agent(tools=[deepgram])

# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")

# Text-to-speech
agent("convert this text to speech: Hello world")

# Audio intelligence
agent("analyze sentiment in call.wav")

Key Features

Speech-to-Text: 30+ language support and speaker diarization
Text-to-Speech: Natural-sounding voices (Aura series)
Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
Speaker Diarization: Identify and separate different speakers
Multi-format Support: WAV, MP3, M4A, FLAC, and more
Real-time Processing: Streaming capabilities for live audio

Configuration

DEEPGRAM_API_KEY=your_deepgram_api_key    # Required
DEEPGRAM_DEFAULT_MODEL=nova-3             # Optional
DEEPGRAM_DEFAULT_LANGUAGE=en              # Optional

Get your API key at: console.deepgram.com

Resources