Japanese 100 Non-professional Speakers Speech Synthesis Corpus - DataoceanAI

Japanese 100 Non-professional Speakers Speech Synthesis Corpus

Content Creation and Entertainment Educational Applications Smart Assistants

This dataset was recorded by 100 speakers with authentic pronunciation and diverse vocal qualities (50 males and 50 females) in a professional recording studio. The recorded texts cover all phonemes, and the annotators have a professional linguistic background, ensuring the data meets the research and development needs for voice synthesis.

Specifications:

ID:

King-TTS-152

Size:

59.64 hours

Language:

Japanese

Country

Japan

Sample rate & bit depth

48 kHz, 16bit

Recording Environment

Professional recording studio

Gender

Male/Female

Content

News, education, film and television, and other fields.

Labeling Process

Text, audio, prosody labeling, quality inspection, phonetic labeling

Accuracy Rate

The accuracy rate of phonetic labeling is 99.5%.

Samples

Audio

People also searched for

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character

Get started