Name: Japanese Multi-speaker Speech Synthesis Corpus (Multi-emotion) - DataoceanAI
SKU: King-TTS-023
Availability: InStock

This dataset was recorded by three voice actors.

ID:

King-TTS-023

Size:

9.77 hours

Language:

Japanese

Country

Japan

Sample rate & bit depth

48 kHz,24bit

Recording Environment

Professional recording studio

Gender

Male/Female

Content

News,technology,conversation,entertainment; happy,sad,surprised,angry.

Labeling Process

text,audio,prosody labeling,quality inspection,tone labeling,phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Get started

Join our newsletter to stay updated