sarulab-speech
/

UTMOSv2

Model card Files Files and versions Community

UTMOSv2: UTokyo-SaruLab MOS Prediction System

Hugging Face Spaces

For more details, please refer to our GitHub repository: https://github.com/sarulab-speech/UTMOSv2

🔖 Citation

@inproceedings{baba2024utmosv2,
  title     = {The T05 System for The {V}oice{MOS} {C}hallenge 2024: Transfer Learning from Deep Image Classifier to Naturalness {MOS} Prediction of High-Quality Synthetic Speech},
  author    = {Baba, Kaito and Nakata, Wataru and Saito, Yuki and Saruwatari, Hiroshi},
  booktitle = {IEEE Spoken Language Technology Workshop (SLT)},
  year      = {2024},
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using sarulab-speech/UTMOSv2 1