ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Middle American English Speech Recognition Corpus (Desktop+Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 100 speakers participating, including 43 males and 57 females. All speakers involved in the recording were professionally screened to ensure standard pronunciation and clear enunciation. The recorded texts cover information such as news and daily conversations.
Modern Standard Arabic Speech Recognition Corpus – ITN Training Set (Mobile)
Moldova Romanian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with 10 speakers participating, including 5 males and 5 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts cover voicemail, paragraph dictation, and other information.
Mongolian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the recording texts covering information on real estate, finance, court proceedings, astronomy, education, and more.
Montenegro Serbian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, involving 10 speakers, comprising 1 male and 9 females. All participants in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover information on voicemail and paragraph dictation.
Moroccan Arabic Speech Recognition Corpus
Morocco Arabic Speech Recognition Corpus ( Phone )
This dataset covers free dialogue content, the topics include news, text messages, car control, music, general, maps, daily oral language, family, health, travel, work, socializing, celebrities, weather, and other common topics in life.
Multi-Language kids Speech Recognition Corpus (Special Device)
This dataset was recorded in a quiet environment, with a total of 140 speakers participating. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers wake-up words and other information.
Multilingual Intelligent Speech Dataset
This dataset covers over 30 scenarios including sports, entertainment, health, shopping, pet, education, food, travel, and so on.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More