diff --git a/fern/providers/voice/sesame.mdx b/fern/providers/voice/sesame.mdx index 1752c528..28349c4e 100644 --- a/fern/providers/voice/sesame.mdx +++ b/fern/providers/voice/sesame.mdx @@ -6,13 +6,12 @@ slug: providers/voice/sesame **What is Sesame CSM-1B?** -Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for seamless integration into your voice applications. Currently in beta, this model delivers natural-sounding speech synthesis with a single default voice option. +Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for seamless integration into your voice applications. This model delivers natural-sounding speech synthesis with a default voice option and voice cloning capabilities. **Key Features:** - **Vapi-Hosted Solution**: Access this open source model directly through Vapi without managing your own infrastructure -- **Single Default Voice**: Currently offers one voice option optimized for clarity and naturalness -- **Beta Release**: Early access to this emerging TTS technology +- **Voice Options**: Offers a default voice and voice cloning capabilities **Integration Benefits:** @@ -27,6 +26,16 @@ Sesame CSM-1B is an open source text-to-speech (TTS) model that Vapi hosts for s - Interactive voice applications - Prototyping voice-driven experiences +**Voice Cloning:** + +![Sesame Voice Cloning](/static/images/voice-tab/sesame/cloning.png) + +Sesame supports voice cloning. To clone a voice: +1. Navigate to the additional configuration tab (below the voice tab) on the assistants page +2. Upload a WAV file containing your voice sample +3. Provide the transcript of the audio file +4. Name your custom voice + **Current Limitations:** -As this is a beta release, the model currently offers limited customization options with only one default voice available. Additional features and voice options may be introduced in future updates. \ No newline at end of file +The model currently has some limitations. Additional features may be introduced in future updates. \ No newline at end of file diff --git a/fern/static/images/voice-tab/sesame/cloning.png b/fern/static/images/voice-tab/sesame/cloning.png new file mode 100644 index 00000000..5bf62a4d Binary files /dev/null and b/fern/static/images/voice-tab/sesame/cloning.png differ