The database recorded 3091 sentences(49564 words)from three male voice talents. The total audio duration is about 7.48 hours, including the silence at the beginning and ending (about 300 ms each).
The recorded content is freetalk, including multiple emotions, such as excited, sad, angry, etc.
The voice talent M1 was born in Morocco in 2004, with standard Saudi Arabic and good English, and was 21 years old when recording the database. The recording has even speech rate.
The voice talent M2 was born and raised in Saudi Arabia in 1994, with standard Saudi Arabic and good English, and was 31 years old when recording the database. The recording has even speech rate.
The voice talent M3 was born and raised in Saudi Arabia in 1992, with standard Saudi Arabic and good English, and was 33 years old when recording the database. The recording has even speech rate.
proofreading -- based on individual word, the accuracy is 99%
Samples
Audio
King-TTS-344-M1_01000014_00003
King-TTS-344-M1_05000018_00002
King-TTS-344-M3_02000012_00005
King-TTS-344-M3_05000023_00002
People also searched for
American English Male and Female Speech Synthesis Corpus (Customer and Audiobook)
This database contains 2000 sentences from one female speaker and one male speaker, with a total audio duration of approximately 2 hours. The texts include
customer and audiobook field.
Brazilian Portuguese Male and Female Speech Synthesis Corpus
The database recorded 2,924 sentences (49,025 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each).
The recorded content is organized into 11 texts, F048-03 including multiple fields, such as news, letters, digit, etc. We used pt-BR_xsampa phone set for labeling.
The voice talents were born and raised in Brazil, in 1969/1973/1997, with standard Brazilian Portuguese and were 56/51/48 years old when recording the database, with a good line foundation. The recordings have even speech rate.
Spain Spanish Male and Female Speech Synthesis Corpus
The database recorded 3,068 sentences (52,494 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each).
The recorded content is organized into 11 texts, F021-04 including multiple fields , such as news, letter, digit, etc. We used es-es_sampa phone set for labeling.
The voice talents were born and raised in Spain in1962/1984/1987, with standard Spanish, and were 63/41/37 years old when recording the database, with a good line foundation. The recording have even speech rate.
New Zealand English Female Speech Synthesis Corpus
The database recorded 1,600 sentences (17,808 words) from a male voice talent. The total audio duration is about 2.04 hours, including the original silence at the beginning and ending (about 350 ms each).
The recorded content is organized into 1 texts, news.
The voice talent was born and raised in New Zealand in 1989, with standard New Zealand English. She is a professional voice talent who has many years of experience in dubbing and broadcasting , with a good line foundation.