strands-deepgram
strands-deepgram is a production-ready speech and audio processing tool powered by Deepgram’s AI platform with 30+ language support.
Installation
Section titled “Installation”pip install strands-deepgramfrom strands import Agentfrom strands_deepgram import deepgram
agent = Agent(tools=[deepgram])
# Transcribe with speaker identificationagent("transcribe this audio: recording.mp3 with speaker diarization")
# Text-to-speechagent("convert this text to speech: Hello world")
# Audio intelligenceagent("analyze sentiment in call.wav")Key Features
Section titled “Key Features”- Speech-to-Text: 30+ language support and speaker diarization
- Text-to-Speech: Natural-sounding voices (Aura series)
- Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
- Speaker Diarization: Identify and separate different speakers
- Multi-format Support: WAV, MP3, M4A, FLAC, and more
- Real-time Processing: Streaming capabilities for live audio
Configuration
Section titled “Configuration”DEEPGRAM_API_KEY=your_deepgram_api_key # RequiredDEEPGRAM_DEFAULT_MODEL=nova-3 # OptionalDEEPGRAM_DEFAULT_LANGUAGE=en # OptionalGet your API key at: console.deepgram.com