This dataset is gender balanced, covering a variety of different voice qualities - mature female, middle-aged male, bass, falsetto, imitation of elderly voice, etc. All have been precisely annotated, including pronunciation, prosody, and paralinguistic features (stress, elongation). The topic includes casual conversations, such as the origin of names, hobbies, childhood experiences, etc.
English Average Voice Synthesis Corpus – Conversation
Participants in pairs are recorded in the same studio, with each individual's voice captured in a separate audio file. No text transcriptions are currently available.