Expand Python API capabilities #197

eginhard · 2024-12-06T11:10:16Z

This PR aligns the Python API more closely with what is available via the CLI. Previously pretrained TTS models could only be used with their default vocoder.

For example, this uses vocoder_models/en/ljspeech/hifigan_v2

from TTS.api import TTS

tts = TTS("tts_models/en/ljspeech/fast_pitch")
_ = tts.tts("hello")

Now you can also pass a different pretrained vocoder name (fixes coqui-ai#3558):

tts = TTS(
    "tts_models/en/ljspeech/fast_pitch",
    vocoder_name="vocoder_models/en/ljspeech/multiband-melgan"
)
_ = tts.tts("hello")

Combining a pretrained TTS model with a local vocoder and vice-versa or passing a custom speaker encoder is also possible now.

I then refactored the CLI to use the Python API internally, so that everything goes through the same pipeline for consistency.

eginhard added 5 commits December 5, 2024 21:19

docs: use .to("cuda") instead of deprecated gpu=True

8c381e3

refactor(api): require keyword arguments except for model_name

5cfb4ec

feat(api): support specifying vocoders by name

42ad9b0

chore(bin.synthesize): remove unused argument

5daed87

feat(api): support passing a custom speaker encoder by path

1a4e58d

eginhard force-pushed the api branch from 51db916 to ba2a288 Compare December 6, 2024 13:30

eginhard added 4 commits December 6, 2024 15:26

feat(api): allow mixing TTS and vocoder model name and path

85dbb3b

chore(api): add type hints

a05177c

feat(api): support passing speaker/language id file paths

89abd98

refactor(api): use save_wav() from Synthesizer instance

806af96

eginhard force-pushed the api branch from ba2a288 to e1fdd7b Compare December 6, 2024 15:19

eginhard requested a review from Colombine-cyber December 6, 2024 15:20

Colombine-cyber approved these changes Dec 6, 2024

View reviewed changes

refactor(bin.synthesize): use Python API for CLI

e0f6211

eginhard force-pushed the api branch from e1fdd7b to e0f6211 Compare December 6, 2024 16:07

eginhard merged commit b545ab8 into dev Dec 6, 2024
35 checks passed

eginhard deleted the api branch December 6, 2024 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand Python API capabilities #197

Expand Python API capabilities #197

eginhard commented Dec 6, 2024 •

edited

Loading

Expand Python API capabilities #197

Expand Python API capabilities #197

Conversation

eginhard commented Dec 6, 2024 • edited Loading

eginhard commented Dec 6, 2024 •

edited

Loading