Skip to content

Commit 4faf96f

Browse files
authored
VAP-9999 Sesame Voice Cloning (#322)
* sesame cloning docs * thanks!
1 parent 7d9c849 commit 4faf96f

File tree

2 files changed

+13
-4
lines changed

2 files changed

+13
-4
lines changed

fern/providers/voice/sesame.mdx

Lines changed: 13 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,13 +6,12 @@ slug: providers/voice/sesame
66

77
**What is Sesame CSM-1B?**
88

9-
Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for seamless integration into your voice applications. Currently in beta, this model delivers natural-sounding speech synthesis with a single default voice option.
9+
Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for seamless integration into your voice applications. This model delivers natural-sounding speech synthesis with a default voice option and voice cloning capabilities.
1010

1111
**Key Features:**
1212

1313
- **Vapi-Hosted Solution**: Access this open source model directly through Vapi without managing your own infrastructure
14-
- **Single Default Voice**: Currently offers one voice option optimized for clarity and naturalness
15-
- **Beta Release**: Early access to this emerging TTS technology
14+
- **Voice Options**: Offers a default voice and voice cloning capabilities
1615

1716
**Integration Benefits:**
1817

@@ -27,6 +26,16 @@ Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for s
2726
- Interactive voice applications
2827
- Prototyping voice-driven experiences
2928

29+
**Voice Cloning:**
30+
31+
![Sesame Voice Cloning](/static/images/voice-tab/sesame/cloning.png)
32+
33+
Sesame supports voice cloning. To clone a voice:
34+
1. Navigate to the additional configuration tab (below the voice tab) on the assistants page
35+
2. Upload a WAV file containing your voice sample
36+
3. Provide the transcript of the audio file
37+
4. Name your custom voice
38+
3039
**Current Limitations:**
3140

32-
As this is a beta release, the model currently offers limited customization options with only one default voice available. Additional features and voice options may be introduced in future updates.
41+
The model currently has some limitations. Additional features may be introduced in future updates.
81.2 KB
Loading

0 commit comments

Comments
 (0)