This dataset contains 18 hours of recordings from 3 male and 2 female speakers, covering various scenarios such as spontaneous dialogue, reading, and mixed Chinese-English reading. All data is precisely labeled, including pronunciation, intonation, and paralinguistic features, ensuring high quality and practical value. The topics of this dataset includes spontaneous dialogue, words, jokes, riddles, proverbs, tongue twisters, poetry, idioms, and interjections.
English Average Voice Synthesis Corpus – Conversation
Participants in pairs are recorded in the same studio, with each individual's voice captured in a separate audio file. No text transcriptions are currently available.