Skip to content

I implemented a ready to use gradio app with more readable code and locally run models #635

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
huynhnhathao opened this issue Apr 10, 2025 · 0 comments

Comments

@huynhnhathao
Copy link

For anyone who is interested, I have implemented a Gradio UI app of BARK that is ready to run on your local computer, plus most of the code in this codebase is reimplemented to be more readable
I also trained a HuBERT model to predict the semantic tokens from audio from a more than 4700 generated ~14s examples of audio-semantic dataset, the validation accuracy of my model is 83% on more than 15k tokens, one can get a feeling of cloning a voice but it is not perfect to impersonate anyone, plus you can do all that in the UI, touching no code

https://github.com/huynhnhathao/bark_text_to_audio

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant