American English Female Speech Synthesis Corpus (Natural Conversational Style) - DataoceanAI

American English Female Speech Synthesis Corpus (Natural Conversational Style)

Audiobooks Healthcare Smart Assistants

This dataset was recorded by a 30-year-old female speaker with authentic pronunciation and a friendly, soft vocal quality in a professional recording studio. The recorded texts span the full range of phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-160

Size:

2.74 hours

Language:

English

Country

United States

Sample rate & bit depth

48 kHz, 16bit

Recording Environment

Professional recording studio

Gender

Female

Content

Daily conversation.

Labeling Process

Text, audio, prosody labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started