TTS

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Norwegian Natural Dialogue-Style Read Speech Synthesis Corpus (Read Speech)
Polish Female Speech Synthesis Corpus
This dataset was recorded by a 37-year-old voice actor with a mature and elegant timbre.
Polish Male Speech Synthesis Corpus
This dataset was recorded by a 32-year-old voice actor with a mature and steady timbre.
Portuguese Female Speech Synthesis Corpus
This dataset was recorded by a 28-year-old voice actor with a neutral-styled timbre.
Portuguese Male Speech Synthesis Corpus
This dataset was recorded by a 28-year-old voice actor with a calm and soft timbre.
Premium Chinese Female Voice Speech Synthesis Corpus (3 Speakers)
Features: Agent-style duplex conversation; the main speaker acts as the Agent and engages in multi-turn dialogues with companion voices in scenarios such as emotional companionship and daily Q&A. Emotion: Both main and companion voices are annotated with 28 emotions—including neutral, joy, anger, sadness, fear, surprise, disgust, etc.—with clause-level emotion tagging.
Romanian Female Speech Synthesis Corpus
This dataset was recorded by a single 33-year-old female speaker with authentic pronunciation and a mature, gentle vocal quality in a professional recording studio. The recorded texts encompass the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Romanian Male Speech Synthesis Corpus
This dataset was recorded by a single 30-year-old male speaker with authentic pronunciation and a calm, gentle vocal quality in a professional recording studio. The recorded texts encompass the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Russian Female Speech Synthesis Corpus
This dataset was recorded by a 30-year-old female speaker with authentic pronunciation and a gentle, friendly vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More