This dataset was recorded by 25 speakers with authentic pronunciation and a variety of vocal qualities (12 males and 13 females) in a professional recording studio. The recorded texts encompass the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Creative people create due to a need to release, and heal open wounds
It is what the vast majority of all men would see if here to-night
I should like to see the man that would make me commit suicide, that's all
The discount will automatically be applied at the checkout
People also searched for
Chinese American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker.
Existing labeling stages: Pronunciation, Prosody
Ongoing labeling: Phoneme boundaries
Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style