Welcome to Deepdub
Deepdub provides a powerful Text-to-Speech API for generating natural, expressive speech with voice cloning, accent control, and real-time streaming. Whether you’re building voiceover pipelines, conversational agents, or content localization workflows, Deepdub delivers studio-quality audio at scale.Key capabilities
Text-to-Speech
Generate speech from text using state-of-the-art models with fine-grained control over tempo, variance, and duration.
Voice Cloning
Clone any voice from a short audio sample. Upload voice prompts or pass a base64-encoded audio reference for instant cloning.
Accent Control
Blend accents between locales with precise ratio control — generate an American English speaker with a French accent, or any combination.
Real-time Streaming
Stream audio in real-time over HTTP or WebSocket connections for low-latency applications.
Try it now
Use the free trial API key to generate speech instantly — no sign-up required:- Python
- JavaScript
API access
Deepdub offers two integration methods:| Method | Endpoint | Use case |
|---|---|---|
| REST API | https://restapi.deepdub.ai/api/v1 | Synchronous audio generation, voice management, retroactive generation |
| WebSocket API | wss://wsapi.deepdub.ai | Real-time streaming with chunked audio delivery |
Quick links
Quickstart
Get up and running with your first TTS generation in minutes.
Voice Presets
Browse ready-to-use voice presets across 6 languages.
Python SDK
Install the Python SDK and start generating speech.
JavaScript SDK
Install the Node.js SDK with real-time streaming support.
REST API Reference
Explore the full REST API with interactive playground.
WebSocket API
Stream audio in real-time with chunked delivery.
