UTMOSv2: UTokyo-SaruLab MOS Prediction System
|
|
|
|
|
For more details, please refer to our GitHub repository: https://github.com/sarulab-speech/UTMOSv2
π Citation
@inproceedings{baba2024utmosv2,
title = {The T05 System for The {V}oice{MOS} {C}hallenge 2024: Transfer Learning from Deep Image Classifier to Naturalness {MOS} Prediction of High-Quality Synthetic Speech},
author = {Baba, Kaito and Nakata, Wataru and Saito, Yuki and Saruwatari, Hiroshi},
booktitle = {IEEE Spoken Language Technology Workshop (SLT)},
year = {2024},
}
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support