This dataset was recorded by a single 33-year-old female speaker with authentic pronunciation and an elegant, mature vocal quality in a professional recording studio. The recorded texts encompass the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
kilenc négy kilenc négy három nulla három egy kettő nyolc
Lebonthatják a Déli pályaudvart
Szerinte az eurózóna legnagyobb gazdaságának mélyrepülése átnyúlik a jövő év első két negyedévére is
Úgy is mindenki Ságvárinak fogja hívni, mondja a nyugalmazott gimnázium igazgató
People also searched for
Chinese American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker.
Existing labeling stages: Pronunciation, Prosody
Ongoing labeling: Phoneme boundaries
Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style