ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Hungarian Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with the participation of 400 speakers, including 199 males and 201 females. All speakers who took part in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded text materials cover various types of information, including health, music, travel, and social interactions.
Hungarian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the involvement of 600 speakers, comprising 298 males and 302 females. All participants in the recording were carefully selected by professionals to ensure standardized pronunciation and clear enunciation. The recorded texts encompass a variety of information, including news and everyday conversations.
Icelandic Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office/home environments, with the participation of 402 speakers, including 198 males and 204 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news and other information.
In-Vehicle Noise Corpus
India Maithili Speech Recognition Corpus – (Mobile)
India Multilingual Speech Corpus
The corpus includes over ten languages such as English, Hindi, Tamil, Telugu, Bengali, Oriya, Assamese, and more, featuring various recording methods including reading aloud, conversations, and sentence construction; covering a range of domains such as digital time, shopping travel, medical education, personal and place names, politics, economy, sports, entertainment, and more.
India Nepali Speech Recognition Corpus – (Mobile)
Indian English Speech Recognition Corpus – Conversations (Mobile)
Free dialogue Unfamiliar Participants: Strangers Familiar Participants: Friends, Relatives, Colleagues, etc.
Indian English Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with a total of 200 speakers participating, including 108 males and 92 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, Twitter, and other information.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More