This dataset contains 27 hours of recordings from 6 males and 21 females. The pronunciation and intonation are precisely annotated to ensure high quality and usability. The voices are recorded by non-professional speakers, making the tones more natural, though some accents or hoarseness may be present. The topics of this dataset includes expanded conversational topics such as daily life, hobbies, and special skills.
Premium Chinese Female Voice Speech Synthesis Corpus (3 Speakers)
Features: Agent-style duplex conversation; the main speaker acts as the Agent and engages in multi-turn dialogues with companion voices in scenarios such as emotional companionship and daily Q&A.
Emotion: Both main and companion voices are annotated with 28 emotions—including neutral, joy, anger, sadness, fear, surprise, disgust, etc.—with clause-level emotion tagging.
Standard Arabic Male Voice Speech Synthesis Corpus
Covers news, dialogues, encyclopedic content, time, date, menus, hotels, and English abbreviations, suitable for multi-scenario, multi-domain speech synthesis.