TTS

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Live streaming Sales-Female Voice
Labeled: Pronunciation & Rhythm
Live-streaming sales-Male voice
Labeled: Pronunciation & Rhythm
Luganda Female Speech Synthesis Corpus
This dataset was recorded by a 22-year-old female speaker with authentic pronunciation and a warm, soft vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Luganda Male Speech Synthesis Corpus
This dataset was recorded by a 23-year-old male speaker with authentic pronunciation and a mature, steady vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Malaysian Female Speech Synthesis Corpus (Customer Service Style)
This dataset was recorded by a 32-year-old female speaker with authentic pronunciation and a mature, elegant vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the needs for research and development in voice synthesis.
Maltese Female Speech Synthesis Corpus
This dataset was recorded by a 26-year-old female speaker with authentic pronunciation and a warm, soft vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Maltese Male Speech Synthesis Corpus
This dataset was recorded by a 34-year-old male speaker with authentic pronunciation and a mature, steady vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.
Mandarin Chinese with Multi-Emotion & Multi-Timbre Synthesis Corpus
The dataset includes 142 distinctive speakers with various emotions such as happiness, sadness, anger, surprise, calmness, dislike, fear, etc.; it can greatly enhance the naturalness and expressiveness of the model.
Mandarin Chinese with Natural Style Voice Synthesis Corpus
The dataset includes 237 speakers, covering a variety of voice qualities such as mature female, middle-aged male, bass, falsetto, etc., and spans across young, middle-aged, and elderly ages. The audio is clear and natural, which can greatly enhance the naturalness and expressiveness of the model.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More