American English Speech Synthesis Corpus (Reocrded by Phone of 620 Speakers) - DataoceanAI

American English Speech Synthesis Corpus (Reocrded by Phone of 620 Speakers)

Accessibility Content Creation and Entertainment Language Learning

This dataset was recorded by 620 speakers with authentic pronunciation and diverse vocal qualities (334 males and 286 females) in a quiet indoor environment. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-170

Size:

9.99 hours

Language:

English

Country

United States

Sample rate & bit depth

16 kHz, 16bit

Recording Environment

Quiet room

Gender

Male/Female

Content

Daily language.

Labeling Process

Text, audio, phone labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started