Chinese Female Speech Synthesis Corpus (Multi-emotion) - DataoceanAI

Chinese Female Speech Synthesis Corpus (Multi-emotion)

Accessibility Healthcare Smart Assistants

This dataset was recorded by a 22-year-old female speaker with authentic pronunciation and a youthful, lively vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-105

Size:

3.59 hours

Language:

Chinese

Country

China

Sample rate & bit depth

48 kHz, 24bit

Recording Environment

Professional recording studio

Gender

Female

Content

Daily language.

Labeling Process

Text, audio, prosody labeling, phone labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

好像有东西突然卡住了他的喉咙，他愤怒地瞪着小家伙。

曼妮语听了威胁之后，不住地颤抖。

突然，小鱼发现有一样东西在一闪一闪的，他睁大眼睛一看。

糖力不明白爸爸妈妈干吗这么记仇。

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started