This dataset was recorded by a 30-year-old male speaker with authentic pronunciation and a calm, gentle vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Saturday Dublin weather foggy, no suitable for divert oneself from boredom
二零一一年入选首批国家技术创新示范企业
秀色摄影,用我们的艺术分享您一生中最美丽的时刻
话说达拉斯这个UTR event越办越成功
People also searched for
Chinese American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker.
Existing labeling stages: Pronunciation, Prosody
Ongoing labeling: Phoneme boundaries
Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style