Name: Standard Arabic Male Speech Synthesis Corpus (Natural Style) - DataoceanAI
SKU: King-TTS-173-1
Availability: InStock

TTS

Standard Arabic Male Speech Synthesis Corpus (Natural Style)

TTS Standard Arabic Male

The database recorded 10,358 sentences (124,825 words) from a male voice talent. The total audio duration is about 18.37 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 5 texts, including multiple fields, such as news, encyclopedia, dialog, etc., also including Arabic English code switching data. We used ar-msa_sampa & en-us_CMU phone set for labeling. The voice talent was born and raised in United Arab Emirates, with standard Arabic and good English. He studied in broadcasting and performance, with a good line foundation. The recording has a friendly and natural timbre and even speech rate.

Specifications:

ID:

King-TTS-173-1

Size:

18.37 hrs

Language:

Arabic

Sample rate & bit depth

48 kHz,24bit

Recording environment

Studio

Speaker

1 male

Devices:

Studio

Accuracy Rate

proofreading -- based on individual word, the accuracy is 99% phonetic labeling -- based on individual phone, the accuracy is 99.5% prosody labeling -- based on individual symbol, the accuracy is 98%

Samples

Audio

King-TTS-173-1-101946

King-TTS-173-1-112310

King-TTS-173-1-121103

King-TTS-173-1-150052_4

People also searched for

American English Male and Female Speech Synthesis Corpus (Customer and Audiobook)

This database contains 2000 sentences from one female speaker and one male speaker, with a total audio duration of approximately 2 hours. The texts include customer and audiobook field.

American English Male Smart Search

Brazilian Portuguese Male and Female Speech Synthesis Corpus

The database recorded 2,924 sentences (49,025 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F048-03 including multiple fields, such as news, letters, digit, etc. We used pt-BR_xsampa phone set for labeling. The voice talents were born and raised in Brazil, in 1969/1973/1997, with standard Brazilian Portuguese and were 56/51/48 years old when recording the database, with a good line foundation. The recordings have even speech rate.

Brazilian Portuguese

Spain Spanish Male and Female Speech Synthesis Corpus

The database recorded 3,068 sentences (52,494 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F021-04 including multiple fields , such as news, letter, digit, etc. We used es-es_sampa phone set for labeling. The voice talents were born and raised in Spain in1962/1984/1987, with standard Spanish, and were 63/41/37 years old when recording the database, with a good line foundation. The recording have even speech rate.

Spain Spanish

New Zealand English Female Speech Synthesis Corpus

The database recorded 1,600 sentences (17,808 words) from a male voice talent. The total audio duration is about 2.04 hours, including the original silence at the beginning and ending (about 350 ms each). The recorded content is organized into 1 texts, news. The voice talent was born and raised in New Zealand in 1989, with standard New Zealand English. She is a professional voice talent who has many years of experience in dubbing and broadcasting , with a good line foundation.

New Zealand English