Skip to content

Files

Latest commit

 

History

History
79 lines (51 loc) · 5.21 KB

README.md

File metadata and controls

79 lines (51 loc) · 5.21 KB

ESPnet Notebooks

Demo

ASR (Speech recognition)

SE (Speech enhancement/separation)

SLU (Spoken language understanding)

TTS (Text-to-speech)

Other utilities

ESPnet-EZ

ASR (Speech recognition)

ST (Speech-to-text translation)

  • integrate_huggingface.ipynb: Integrating the weakly-supervised model (OWSM) and huggingface's pre-trained language model with ESPnet-EZ on MuST-C-v2.
  • ST_finetune_owsm.ipynb: Fine-tuning the weakly-supervised model (OWSM) with ESPnet-EZ on MuST-C-v2.

SLU (Spoken language understanding)

TTS (Text-to-speech)

SVS (Singing voice synthesis)

Course

CMU SpeechProcessing Spring2023

CMU SpeechRecognition Fall2022

CMU SpeechRecognition Fall2021

ESPnet1 (Legacy)