Name: Indian English Male and Female Multi-emotion Speech Synthesis Corpus (Free Talk) - DataoceanAI
SKU: King-TTS-358
Availability: InStock

TTS

Indian English Male and Female Multi-emotion Speech Synthesis Corpus (Free Talk)

speech synthesis English

The database recorded 4,086 sentences (92,399 words) from three male voice talents and two female talents. The total audio duration is about 10.77 hours, including the original silence at the beginning and ending (about 300ms each). The recorded content is organized into 5 texts, including 5 emotions. They are excited, sad, angry, fear and empathy. The female speakers were born and raised in India in 2000, 1998, and 1993 respectively, while the male speakers were born and raised in India in 2000 and 1981 respectively. They all speak standard Indian English, and were between the ages of 25 and 44 when recording the database.

Specifications:

ID:

King-TTS-358

Size:

10.77 hours

Language:

English (Indian)

Sample rate & bit depth

48 kHz,24bit

Recording environment

Studio

Speaker

2 males and 2 females

Devices:

Studio

Accuracy Rate

proofreading -- based on individual word, the accuracy is 99%

Samples

Audio

King-TTS-358-F1_0100024_00005

King-TTS-358-F1_0300009_00005

King-TTS-358-F3_0100001_00029

King-TTS-358-F3_0500007_00014

People also searched for

American English Male and Female Speech Synthesis Corpus (Customer and Audiobook)

This database contains 2000 sentences from one female speaker and one male speaker, with a total audio duration of approximately 2 hours. The texts include customer and audiobook field.

American English Male Smart Search

Brazilian Portuguese Male and Female Speech Synthesis Corpus

The database recorded 2,924 sentences (49,025 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F048-03 including multiple fields, such as news, letters, digit, etc. We used pt-BR_xsampa phone set for labeling. The voice talents were born and raised in Brazil, in 1969/1973/1997, with standard Brazilian Portuguese and were 56/51/48 years old when recording the database, with a good line foundation. The recordings have even speech rate.

Brazilian Portuguese

Spain Spanish Male and Female Speech Synthesis Corpus

The database recorded 3,068 sentences (52,494 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F021-04 including multiple fields , such as news, letter, digit, etc. We used es-es_sampa phone set for labeling. The voice talents were born and raised in Spain in1962/1984/1987, with standard Spanish, and were 63/41/37 years old when recording the database, with a good line foundation. The recording have even speech rate.

Spain Spanish

New Zealand English Female Speech Synthesis Corpus

The database recorded 1,600 sentences (17,808 words) from a male voice talent. The total audio duration is about 2.04 hours, including the original silence at the beginning and ending (about 350 ms each). The recorded content is organized into 1 texts, news. The voice talent was born and raised in New Zealand in 1989, with standard New Zealand English. She is a professional voice talent who has many years of experience in dubbing and broadcasting , with a good line foundation.

New Zealand English