This dataset was recorded by a 25-year-old female speaker with authentic pronunciation and a gentle, sweet vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Text, audio, phone labeling, quality inspection, xml labeling, run labeling, phonetic labeling
Accuracy Rate
The accuracy rate of phonetic labeling is 99.5%.
Samples
Audio
如果再见不能红着眼,是否还能红着脸
旧时月色织成多少诗篇,青苔造访神道的石阶
梦是远飞翔,你就是我左半边翅膀
People also searched for
Chinese American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker.
Existing labeling stages: Pronunciation, Prosody
Ongoing labeling: Phoneme boundaries
Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style