Skip to content

i got ouput like "[sounds of a bus]" and "[Side-by-side conversation]" "[Pause]" ,does anyone know why?thx #2931

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
tokyo4 opened this issue Mar 23, 2025 · 1 comment

Comments

@tokyo4
Copy link

tokyo4 commented Mar 23, 2025

Image
Image
Image

[SIDE CONVERSATION]
[Sounds of a man talking in the background]

Is it because of poor sound quality, such as low volume? Thank you

in the real time sound,this output also make i lose the ASR output text before.

@misutoneko
Copy link

Never seen those particular sound descriptions but I suppose poor quality can do that.
What was the actual command line? Because as of right now, it isn't even clear which model was used.
The smaller models are more likely to miss things, so switching to a bigger model may help.

If you can post a sample that might clear things up as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants