ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Latin American Spanish Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 60 speakers participating, including 31 males and 29 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear enunciation. The recorded texts cover information on everyday conversations and other related topics.
Latvian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 110 males and 90 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts cover a range of information, including news and daily conversations.
Libyan Arabic Speech Recognition Corpus – Dialogue (Mobile)
This is Libyan Arabic conversational speech dataset ,which is collected over Android and iOS devices. The corpus contains 50 pairs of Libyan spontaneous conversational speech, which were from 100 speakers. For this collection, 2 speakers of each group performed the recording in separate quiet rooms. 21 topics were contained in this dataset. The audio duration is about 118.5 hours and the pure recording time is about 56.0 hours, including the reasonable leading and trailing silence. The total size of this dataset is 12.7 G.
Lithuanian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the involvement of 201 speakers, comprising 99 males and 102 females. All participants in the recording were carefully selected by professionals to ensure standardized pronunciation and clear enunciation. The recorded texts span various types of information, including news and everyday conversations.
Luxembourg German Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with 10 speakers participating, including 5 males and 5 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts cover voicemail, paragraph dictation, and other information.
Macedonian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the participation of 402 speakers, including 185 males and 217 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news and everyday conversations.
Malay Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 720 speakers participating, including 351 males and 369 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers shopping, education, work, and other information.
Malay Conversational Speech Recognition corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 474 speakers participating, including 226 males and 248 females. All participants involved in the recording were professionally screened to ensure standard pronunciation and clear articulation. The recorded texts cover information on health, sports, education, and other related topics.
Malaysian Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with a total of 200 speakers participating, including 100 males and 100 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers daily conversations, news, and other information.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More