TTS and STS Models to port to MLX-Audio (Roadmap) #1

Blaizzy · 2025-02-28T14:50:15Z

dwohlfahrt · 2025-02-28T18:04:39Z

As always, thanks a MILLION for all the work you do @Blaizzy. You are a legend in the truest sense of the word 🙏

And now, of course, I have to chime in with my own selfish requests 😄

For TTS, adding on a +1 for Zonos
For STS, RVC would be huge, as I run kokoro outputs through it using RVC-generated fine tunes of custom voices to basically get the best of both worlds (aka kokoro with true voice cloning). Results are absolute 🔥 😃

Blaizzy · 2025-02-28T19:32:57Z

Thanks a lot, it's my pleasure!

Yes, Zonos is on the way 🚀

Could you share this RVC + Kokoro example?

szafranek · 2025-03-01T19:16:57Z

I found this project through your awesome demo.

Would you consider supporting StyleTTS2?

chigkim · 2025-03-19T00:27:28Z

This is amazing!!!
@Blaizzy when do you sleep? VLM, now Audio?
Anyways, there's Outes. It's based on text LLMs, so even llama.cpp and ExLlamaV2 can run it.
Thanks!

chigkim · 2025-03-19T22:52:16Z

Orpheus-TTS is just released, and it sounds really good!
https://github.com/canopyai/Orpheus-TTS

lin72h · 2025-03-19T22:55:21Z

@chigkim the KING already start working on it: #47

Blaizzy changed the title ~~TTS and STS Models to port to MLX-Audio~~ TTS and STS Models to port to MLX-Audio (Roadmap) Feb 28, 2025

Blaizzy added the good first issue Good for newcomers label Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS and STS Models to port to MLX-Audio (Roadmap) #1

TTS and STS Models to port to MLX-Audio (Roadmap) #1

Blaizzy commented Feb 28, 2025 •

edited

Loading

dwohlfahrt commented Feb 28, 2025 •

edited

Loading

Blaizzy commented Feb 28, 2025 •

edited

Loading

szafranek commented Mar 1, 2025 •

edited

Loading

chigkim commented Mar 19, 2025

chigkim commented Mar 19, 2025

lin72h commented Mar 19, 2025

TTS and STS Models to port to MLX-Audio (Roadmap) #1

TTS and STS Models to port to MLX-Audio (Roadmap) #1

Comments

Blaizzy commented Feb 28, 2025 • edited Loading

Overview

Text-to-Speech (TTS) Models

Planned TTS Models

Speech-to-Speech (STS) Models

Planned STS Models

Technical Considerations

Community Input

dwohlfahrt commented Feb 28, 2025 • edited Loading

Blaizzy commented Feb 28, 2025 • edited Loading

szafranek commented Mar 1, 2025 • edited Loading

chigkim commented Mar 19, 2025

chigkim commented Mar 19, 2025

lin72h commented Mar 19, 2025

Blaizzy commented Feb 28, 2025 •

edited

Loading

dwohlfahrt commented Feb 28, 2025 •

edited

Loading

Blaizzy commented Feb 28, 2025 •

edited

Loading

szafranek commented Mar 1, 2025 •

edited

Loading