This dataset was recorded by a 30-year-old male speaker with authentic pronunciation and a mature, composed vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the needs for research and development in voice synthesis.
He would grow large in one second, and shrink down in the next
不用,这天也没刮风,没事儿,赶快吃完回去洗澡吧
关于牧区养牲口的amount由此可略知一二
如果死在这辆火车上能让你闭嘴我求之不得
People also searched for
Chinese American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker.
Existing labeling stages: Pronunciation, Prosody
Ongoing labeling: Phoneme boundaries
Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style