This dataset is gender balanced, covering a variety of different voice qualities - mature female, middle-aged male, bass, falsetto, imitation of elderly voice, etc. All have been precisely annotated, including pronunciation, prosody, and paralinguistic features (stress, elongation). The topic includes casual conversations, such as the origin of names, hobbies, childhood experiences, etc.
Premium Chinese Female Voice Speech Synthesis Corpus (3 Speakers)
Features: Agent-style duplex conversation; the main speaker acts as the Agent and engages in multi-turn dialogues with companion voices in scenarios such as emotional companionship and daily Q&A.
Emotion: Both main and companion voices are annotated with 28 emotions—including neutral, joy, anger, sadness, fear, surprise, disgust, etc.—with clause-level emotion tagging.
Standard Arabic Male Voice Speech Synthesis Corpus
Covers news, dialogues, encyclopedic content, time, date, menus, hotels, and English abbreviations, suitable for multi-scenario, multi-domain speech synthesis.