UTMOSv2: UTokyo-SaruLab MOS Prediction System

GitHub Hugging Face Spaces
arXiv poster Open In Colab

For more details, please refer to our GitHub repository: https://github.com/sarulab-speech/UTMOSv2

πŸ”– Citation

@inproceedings{baba2024utmosv2,
  title     = {The T05 System for The {V}oice{MOS} {C}hallenge 2024: Transfer Learning from Deep Image Classifier to Naturalness {MOS} Prediction of High-Quality Synthetic Speech},
  author    = {Baba, Kaito and Nakata, Wataru and Saito, Yuki and Saruwatari, Hiroshi},
  booktitle = {IEEE Spoken Language Technology Workshop (SLT)},
  year      = {2024},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Space using sarulab-speech/UTMOSv2 1