The corpus includes over ten languages such as English, Hindi, Tamil, Telugu, Bengali, Oriya, Assamese, and more, featuring various recording methods including reading aloud, conversations, and sentence construction; covering a range of domains such as digital time, shopping travel, medical education, personal and place names, politics, economy, sports, entertainment, and more.
Chinese English & American English Speech Recognition Corpus (desktop)
This dataset includes 6 topic for Office White-collar Meeting Scenario - IT and Internet, Finance, Clear Energy, Healthcare, Media and Consumer Electronics.