ASR

Search our off-the-shelf datasets.

Filter by
Language
Search
Language
Devices
Devices
More
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Multilingual Intelligent Speech Dataset
This dataset covers over 30 scenarios including sports, entertainment, health, shopping, pet, education, food, travel, and so on.
Cantonese Speech Recognition Corpus (Mobile)
This dataset was recorded in noisy environments such as shopping malls, streets, and cars, with a total of 149 speakers participating, including 72 males and 77 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, Twitter, and other information.
Uyghur Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 718 speakers participating, including 327 males and 391 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, Twitter, and other information.
Turkish Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 50 speakers participating, including 26 males and 24 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers family, sports, travel, pets, and other information.
American English Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 252 speakers participating, including 131 males and 121 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, chats, Twitter, and other information.
Korean Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 250 speakers participating, including 118 males and 132 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, Twitter, and other information.
Hokkien Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 249 speakers participating, including 99 males and 150 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, and other information.
Hokkien Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 233 speakers participating, including 104 males and 129 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers family, life, shopping, sports, and other information.
Uyghur Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with a total of 240 speakers participating, including 116 males and 124 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers family, music, friends, life, and other information.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Search
Language
Devices
Devices
More
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More