fsicoli/twi
Viewer • Updated • 2.44k • 28
How to use chuksDev/speecht5_tts_npontu_twi with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-audio", model="chuksDev/speecht5_tts_npontu_twi") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("chuksDev/speecht5_tts_npontu_twi")
model = AutoModelForTextToSpectrogram.from_pretrained("chuksDev/speecht5_tts_npontu_twi")This model is a fine-tuned version of microsoft/speecht5_tts on the FsicoliTwi dataset. It achieves the following results on the evaluation set:
Npontu Twi is designed to synthesize Twi-language speech with a focus on Ghanaian accents and cultural nuances. Leveraging pure language modeling, Npontu Twi offers high-quality, natural, and culturally relevant speech synthesis for diverse applications, including education, entertainment, and communication in Ghana and beyond.
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4207 | 14.4928 | 1000 | 0.3869 |
| 0.41 | 28.9855 | 2000 | 0.3824 |
Base model
microsoft/speecht5_tts