The important question

by yukiarimo - opened 5 days ago

Discussion

yukiarimo

5 days ago

So, how about releasing the full dataset? Or you have just illegally ripped off stolen voices from the web?

xscMpOV

5 days ago

surely this is the best way to ask for anything

mueller91

4 days ago

most companies/developers wouldn't release a training dataset, even when the model is open source. this is not unusual.

yukiarimo

4 days ago

•

edited 3 days ago

most companies/developers wouldn't release a training dataset, even when the model is open source. this is not unusual.

A bunch of projects like VITS, Tacotron, etc., have released! (And usually they use LJSpeech)
If you not even say where the data is coming from, it's definitely 100% stolen and they MUST be banned from HF!

flowring-luyiourwong

about 14 hours ago

A bunch of projects like VITS, Tacotron, etc., have released! (And usually they use LJSpeech)

If you not even say where the data is coming from, it's definitely 100% stolen and they MUST be banned from HF!

right, hf should ban 95% models include gpt, llama, gemma as well. none of them have release datasets lol

btw, maya actully notes training data in the metadata

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment