India Multilingual Speech Corpus

The corpus includes over ten languages such as English, Hindi, Tamil, Telugu, Bengali, Oriya, Assamese, and more, featuring various recording methods including reading aloud, conversations, and sentence construction; covering a range of domains such as digital time, shopping travel, medical education, personal and place names, politics, economy, sports, entertainment, and more.
Specifications:
ID:
King-ASR-967
Size:
8645 hours
Language:
India
Speakers
13150

People also searched for

Mandarin Chinese Duplex Dialogue Corpus (Desktop)
This dataset contains topics such as casual conversations / business meetings (home, workplace, weekly meetings, regular meetings)
Mandarin Chinese Speech Recognition Corpus – ITN Training Set (Mobile)
Indian Odia Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 123 males and 77 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear articulation. The recorded text materials cover information such as news.
Sichuan Dialect Speech Recognition Corpus

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.