American English Male Speech Synthesis Corpus - DataoceanAI

American English Male Speech Synthesis Corpus

Accessibility Audiobooks Smart Assistants

This dataset was recorded by a 58-year-old male speaker with authentic pronunciation and a mature, steady vocal quality in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-230

Size:

19.39 hours

Language:

English

Country

United States

Sample rate & bit depth

48 kHz, 16bit

Recording Environment

Professional recording studio

Gender

Male

Content

News, conversation, encyclopedia, transportation, reservation, booking, hotel, etc.

Labeling Process

Text, audio, prosody labeling, phone labeling, pos labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started