-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial commit: Add task audio-text-to-text #1212
base: main
Are you sure you want to change the base?
Conversation
- **Instruction:** | ||
Base models fine-tuned on specialized audio instruction datasets to better handle task-specific querie and conversation. For instance, [Ichigo-llama3.1-s-instruct-v0.4](https://huggingface.co/homebrewltd/Ichigo-llama3.1-s-instruct-v0.4) has been optimized to follow detailed audio-related commands. | ||
|
||
### Use Cases |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd give examples for all of these use-cases!
Models can control computing workflows by parsing spoken instructions, making interactions more natural and accessible. | ||
|
||
|
||
### Useful Resources |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can put ultravox, ichigo, ultravox as resources here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do I add an audio file for the demo ?
The ASR task page would serve as a good reference for that
Co-authored-by: vb <vaibhavs10@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Feel free to make this Ready to review
when you are done iterating - let's merge this soon!
This is what I did, my question is where should I store the |
Add audio demo file for audio-text-to-text task
I've updated the audio-text-to-text task, but I need guidance on:
How do I add an audio file for the demo ?