TTS

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Chinese Mandarin Male Speech Synthesis Corpus (Sunshine Juvenile)
This dataset was recorded by a 21-year-old male speaker with authentic pronunciation and a sunny, energetic vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese Mandarin Male Speech Synthesis Corpus (Tough Man)
This dataset was recorded by a 30-year-old male speaker with authentic pronunciation and a calm, resolute vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese Multi-dialect Speech Synthesis Corpus (Marketing)
This dataset was recorded by 5 female speakers with authentic pronunciation in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese Multi-speaker – Amateur Multi-Emotion, Multi-Style
This dataset consists of 11 hours of recordings with a balanced gender ratio. It has been meticulously labeled, including pronunciation, prosody, and voice quality labeling. The voice samples, recorded by non-professional speakers, offer a higher degree of naturalness and are categorized and labeled according to voice gender, perceived age, voice description, vocal cord condition, and pronunciation location. The topic includes multi-emotional data, covering emotions such as calm, happy, angry, sad, and more.
Chinese Multi-speaker – Amateur Spontaneous Dialogue
This dataset contains 27 hours of recordings from 6 males and 21 females. The pronunciation and intonation are precisely annotated to ensure high quality and usability. The voices are recorded by non-professional speakers, making the tones more natural, though some accents or hoarseness may be present. The topics of this dataset includes expanded conversational topics such as daily life, hobbies, and special skills.
Chinese Multi-speaker – Spontaneous Dialogue
This dataset contains 18 hours of recordings from 3 male and 2 female speakers, covering various scenarios such as spontaneous dialogue, reading, and mixed Chinese-English reading. All data is precisely labeled, including pronunciation, intonation, and paralinguistic features, ensuring high quality and practical value. The topics of this dataset includes spontaneous dialogue, words, jokes, riddles, proverbs, tongue twisters, poetry, idioms, and interjections.
Chinese Multi-speaker – Unique Voices
This dataset is gender balanced, covering a variety of different voice qualities - mature female, middle-aged male, bass, falsetto, imitation of elderly voice, etc. All have been precisely annotated, including pronunciation, prosody, and paralinguistic features (stress, elongation). The topic includes casual conversations, such as the origin of names, hobbies, childhood experiences, etc.
Chinese Multi-speaker Speech Synthesis Corpus (100 speakers)
This dataset was recorded by one hundred voice actors.
Chinese Multi-speaker Speech Synthesis Corpus (Multiple Styles Virtual Lovers)
This dataset was recorded by 12 speakers with authentic pronunciation and diverse vocal qualities (6 males and 6 females) in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators possess a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More