The voices with Chinese origin when generated as English samples do sound like a Chinese person speaking English. It is very interesting.
vessenes 12 hours ago [-]
This is really quite good at sounding like Donald, especially for the first half of the audio. I’ll probably play around with this for a bit; it’s. It clear to me how much variation you can get in voice in latent space. Anyway it looks to be a very high quality (at least) short form tts engine with open weights so thanks team!
fdafds 14 hours ago [-]
[flagged]
Rendered at 14:20:59 GMT+0000 (UTC) with Wasmer Edge.
https://spark-tts.github.io/