TTS

American English Speech Synthesis Corpus (Reocrded by Phone of 620 Speakers)

This dataset was recorded by 620 speakers with authentic pronunciation and diverse vocal qualities (334 males and 286 females) in a quiet indoor environment. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Accessibility Content Creation and Entertainment Language Learning

American English Synthesis Corpus

50 speakers, gender balanced, with pronunciation and prosody annotations

Audiobooks Content Creation and Entertainment Language Learning

American Pop Songs Speech Synthesis Corpus (150 songs)

This dataset was recorded by 5 speakers with authentic pronunciation and diverse vocal qualities (one male and four females) in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Content Creation and Entertainment Music Applications Music Production

Australian English Female Speech Synthesis Corpus

This dataset was recorded by a 59-year-old female speaker with authentic pronunciation and a mature, elegant vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Accessibility Content Creation and Entertainment Smart Assistants

Australian English Male and Female Multi-Emotion Speech Synthesis Corpus (Free Talk)

[Emotion] Five emotions: Excited, Sad, Angry, Fearful, and Empathetic. [Duration] Approximately 2 hours per speaker.

speech synthesis English

Australian English Male Speech Synthesis Corpus

This dataset was recorded by a 42-year-old male speaker with authentic pronunciation and a mature, refined vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Accessibility Content Creation and Entertainment Smart Assistants

Australian English Natural Conversational Speech Synthesis Corpus (Read Aloud)

[Style] Voice Assistant style, Podcast style, Audiobook style, and Online Learning style (four styles in total). [Duration] Approximately 2 hours per speaker, with about 0.5 hours for each style. [Content] Voice assistant data is recorded as single utterances, while the other styles are recorded in paragraph-level segments.

speech synthesis English

Austrian German Natural Conversational Speech Synthesis Corpus (Read Speech)

speech synthesis German

Austrian German Natural Dialogue Speech Synthesis Corpus (Conversational Speech)

speech synthesis German

Filter by

American English Speech Synthesis Corpus (Reocrded by Phone of 620 Speakers)

American English Synthesis Corpus

American Pop Songs Speech Synthesis Corpus (150 songs)

Australian English Female Speech Synthesis Corpus

Australian English Male and Female Multi-Emotion Speech Synthesis Corpus (Free Talk)

Australian English Male Speech Synthesis Corpus

Australian English Natural Conversational Speech Synthesis Corpus (Read Aloud)

Austrian German Natural Conversational Speech Synthesis Corpus (Read Speech)

Austrian German Natural Dialogue Speech Synthesis Corpus (Conversational Speech)

Get started

Filter by

Filter by

TTS

Filter by

American English Speech Synthesis Corpus (Reocrded by Phone of 620 Speakers)

American English Synthesis Corpus

American Pop Songs Speech Synthesis Corpus (150 songs)

Australian English Female Speech Synthesis Corpus

Australian English Male and Female Multi-Emotion Speech Synthesis Corpus (Free Talk)

Australian English Male Speech Synthesis Corpus

Australian English Natural Conversational Speech Synthesis Corpus (Read Aloud)

Austrian German Natural Conversational Speech Synthesis Corpus (Read Speech)

Austrian German Natural Dialogue Speech Synthesis Corpus (Conversational Speech)

Get started

Join our newsletter to stay updated

Filter by

Filter by