ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Malay Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 720 speakers participating, including 351 males and 369 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers shopping, education, work, and other information.
Malay Conversational Speech Recognition corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 474 speakers participating, including 226 males and 248 females. All participants involved in the recording were professionally screened to ensure standard pronunciation and clear articulation. The recorded texts cover information on health, sports, education, and other related topics.
Malaysian Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with a total of 200 speakers participating, including 100 males and 100 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers daily conversations, news, and other information.
Malaysian Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office and home environments, with 131 speakers participating, including 65 males and 66 females. All speakers who took part in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover information such as everyday conversations and news updates.
Malaysian Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office and home environments, with the involvement of 200 speakers, comprising 96 males and 104 females. All participants in the recordings were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts encompass information such as news and chat conversations.
Maltese Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the participation of 203 speakers, comprising 87 males and 116 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear enunciation. The recorded texts cover information on news, daily conversations, and other topics.
Mandarin Chinese Duplex Dialogue Corpus (Desktop)
This dataset contains topics such as casual conversations / business meetings (home, workplace, weekly meetings, regular meetings)
Mandarin Chinese Duplex Dialogue Corpus (Mobile)
It includes a variety of scenarios such as daily casual conversations, AI, and new energy.
Mandarin Chinese Speech Recognition Corpus – ITN Training Set (Mobile)

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More