TTS

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Chinese Speak American English Synthesis Corpus
This datasets contains 80 speakers, with a balanced gender ratio, approximately 1.5 hours of data per speaker. Existing labeling stages: Pronunciation, Prosody Ongoing labeling: Phoneme boundaries Overview: Focuses on common/fundamental language, includes everyday dialogue in a natural style
Chinese Speech Synthesis Corpus (Role play)
This dataset was recorded by three speakers with authentic pronunciation and diverse vocal qualities (1 male and 2 females) in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese Sweet Female Song Synthesis Corpus (200 Pop Songs)
This dataset was recorded by a 25-year-old female speaker with authentic pronunciation and a lively, sweet vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators possess a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese-English Code-mixing Male Speech Synthesis Corpus
This dataset was recorded by a 30-year-old male speaker with authentic pronunciation and a calm, gentle vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Chinese-English Code-mixing Male Speech Synthesis Corpus (Food Program Commentary)
This dataset was recorded by a 38-year-old male speaker with authentic pronunciation and a mature, refined vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Czech Female Speech Synthesis Corpus
This dataset was recorded by a 37-year-old voice actor with a mature and grand timbre.
Czech Male Speech Synthesis Corpus
This dataset was recorded by a 32-year-old voice actor with a mature and grand timbre.
Danish Female Speech Synthesis Corpus
This dataset was recorded by a 27-year-old voice actor with a mature and soft timbre.
Danish Male Speech Synthesis Corpus
This dataset was recorded by a 26-year-old voice actor with a steady and mature timbre.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More