Initial commit: Add task audio-text-to-text #1212

Deep-unlearning · 2025-02-20T13:37:08Z

Add audio demo file for audio-text-to-text task

I've updated the audio-text-to-text task, but I need guidance on:

How do I add an audio file for the demo ?

packages/tasks/src/tasks/audio-text-to-text/about.md

Vaibhavs10 · 2025-02-24T15:14:29Z

packages/tasks/src/tasks/audio-text-to-text/about.md

+- **Instruction:**  
+  Base models fine-tuned on specialized audio instruction datasets to better handle task-specific querie and conversation. For instance, [Ichigo-llama3.1-s-instruct-v0.4](https://huggingface.co/homebrewltd/Ichigo-llama3.1-s-instruct-v0.4) has been optimized to follow detailed audio-related commands.
+
+### Use Cases


I'd give examples for all of these use-cases!

Vaibhavs10 · 2025-02-24T15:15:10Z

packages/tasks/src/tasks/audio-text-to-text/about.md

+  Models can control computing workflows by parsing spoken instructions, making interactions more natural and accessible.
+
+
+### Useful Resources


You can put ultravox, ichigo, ultravox as resources here

Vaibhavs10

How do I add an audio file for the demo ?

The ASR task page would serve as a good reference for that

Co-authored-by: vb <vaibhavs10@gmail.com>

Vaibhavs10

Feel free to make this Ready to review when you are done iterating - let's merge this soon!

Deep-unlearning · 2025-02-28T14:17:57Z

How do I add an audio file for the demo ?

The ASR task page would serve as a good reference for that

This is what I did, my question is where should I store the sample-audio.wav ? For images, I saw that they were here: https://huggingface.co/datasets/huggingfacejs/tasks
But what about the audio files ?

Deep-Unlearning and others added 2 commits February 20, 2025 14:32

Initial commit: Add task audio-text-to-text

0d56ce2

Merge branch 'main' into audio-text-to-text

65ffff2

Vaibhavs10 reviewed Feb 24, 2025

View reviewed changes

Update packages/tasks/src/tasks/audio-text-to-text/about.md

717c0d0

Co-authored-by: vb <vaibhavs10@gmail.com>

Vaibhavs10 reviewed Feb 26, 2025

View reviewed changes

Deep-unlearning and others added 2 commits February 28, 2025 12:38

Merge branch 'main' into audio-text-to-text

365870d

added useful resources

8d6bd6f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial commit: Add task audio-text-to-text #1212

Initial commit: Add task audio-text-to-text #1212

Deep-unlearning commented Feb 20, 2025

Vaibhavs10 Feb 24, 2025

Vaibhavs10 Feb 24, 2025

Vaibhavs10 left a comment

Vaibhavs10 left a comment

Deep-unlearning commented Feb 28, 2025

		Models can control computing workflows by parsing spoken instructions, making interactions more natural and accessible.


		### Useful Resources

Initial commit: Add task audio-text-to-text #1212

Are you sure you want to change the base?

Initial commit: Add task audio-text-to-text #1212

Conversation

Deep-unlearning commented Feb 20, 2025

Add audio demo file for audio-text-to-text task

Vaibhavs10 Feb 24, 2025

Choose a reason for hiding this comment

Vaibhavs10 Feb 24, 2025

Choose a reason for hiding this comment

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Deep-unlearning commented Feb 28, 2025