Chinese and English Multi-speaker Speech Synthesic Corpus（Multi-domain & emotion） - DataoceanAI

Chinese and English Multi-speaker Speech Synthesic Corpus（Multi-domain & emotion）

Audiobooks Content Creation and Entertainment Smart Assistants

This dataset was recorded by 60 speakers with authentic pronunciation and diverse vocal qualities (30 males and 30 females) in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-100

Size:

37.02 hours

Language:

Chinese-English Code-mixing

Country

China

Sample rate & bit depth

48 kHz, 16bit

Recording Environment

Professional recording studio

Gender

Male/Female

Content

Daily language, news, poetry, and other fields.

Labeling Process

Text, audio, prosody labeling, POS labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

That will be decided months and months from now

仔细看看“America”这个单词，“America”是英语中“美国”的意思

据研究，卦爻辞反映了殷周之际及西周初期的社会生活

知识分子要建立一个群体，而且要具备批判的自我意识

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started