Name: Mandarin Male Speech Synthesis Corpus (Audiobook) - DataoceanAI
SKU: King-TTS-162
Availability: InStock

TTS

Mandarin Male Speech Synthesis Corpus (Audiobook)

Audiobooks Content Creation and Entertainment Language Learning

The database recorded 8658 sentences (137253 words) from a male voice talent. Among them, 4758 sentences were recorded into 36 pieces of audio in paragraph format. The total audio duration is about 13.26 hours, including the original silence at the beginning and ending (about 1.500 s each). The recorded content is sourced from novels and organized into two scripts, which include novel paragraph excerpts and additional sentences. We used zh-cn_pinyin phone set for labeling. The voice talent is a professional broadcaster and speaks standard Mandarin. He was born in China, with steady voice and good consistency.

Specifications:

ID:

King-TTS-162

Size:

13.26 hours

Language:

Chinese

Sample rate & bit depth

48 kHz, 16bit

Recording environment

Professional recording studio

Speaker

1 male

Devices:

Studio

Accuracy Rate

proofreading -- based on individual word, the accuracy is 99% phonetic labeling -- based on individual phone, the accuracy is 99.5% prosody labeling -- based on individual symbol, the accuracy is 98%

Samples

Audio

King-TTS-162-000017_0066

King-TTS-162-000035_0182

King-TTS-162-000038_0590

King-TTS-162-000038_1605

People also searched for

American English Male and Female Speech Synthesis Corpus (Customer and Audiobook)

The database recorded 581 sentences (15,617 words) from a male and a female voice talents. The total audio duration is about 2.05 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 2 texts. The female speakers' texts are related to customer service, while the male speakers' texts are related to audio books.

speech synthesis American English Male

Brazilian Portuguese Male and Female Speech Synthesis Corpus

The database recorded 2,924 sentences (49,025 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F048-03 including multiple fields, such as news, letters, digit, etc. We used pt-BR_xsampa phone set for labeling. The voice talents were born and raised in Brazil, in 1969/1973/1997, with standard Brazilian Portuguese and were 56/51/48 years old when recording the database, with a good line foundation. The recordings have even speech rate.

Brazilian Portuguese

Spain Spanish Male and Female Speech Synthesis Corpus

The database recorded 3,068 sentences (52,494 words) from 3 voice talents(2 females and 1 male). The total audio duration is about 6 hours, including the original silence at the beginning and ending (about 300 ms each). The recorded content is organized into 11 texts, F021-04 including multiple fields , such as news, letter, digit, etc. We used es-es_sampa phone set for labeling. The voice talents were born and raised in Spain in1962/1984/1987, with standard Spanish, and were 63/41/37 years old when recording the database, with a good line foundation. The recording have even speech rate.

Spain Spanish

New Zealand English Female Speech Synthesis Corpus

The database recorded 1,600 sentences (17,808 words) from a male voice talent. The total audio duration is about 2.04 hours, including the original silence at the beginning and ending (about 350 ms each). The recorded content is organized into 1 texts, news. The voice talent was born and raised in New Zealand in 1989, with standard New Zealand English. She is a professional voice talent who has many years of experience in dubbing and broadcasting , with a good line foundation.

New Zealand English