ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Korean Speech Recognition corpus (Mobile)
This dataset was recorded in both quiet and noisy environments, with a total of 310 speakers participating, including 160 males and 150 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers numbers, dates, times, personal names, and other information.
Korean Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 250 speakers participating, including 118 males and 132 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, Twitter, and other information.
Laos Lao Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office/home environments, with the participation of 400 speakers, including 204 males and 196 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news and other information.
Latin American Spanish Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 60 speakers participating, including 31 males and 29 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear enunciation. The recorded texts cover information on everyday conversations and other related topics.
Latvian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 110 males and 90 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts cover a range of information, including news and daily conversations.
Libyan Arabic Speech Recognition Corpus – Dialogue (Mobile)
This is Libyan Arabic conversational speech dataset ,which is collected over Android and iOS devices. The corpus contains 50 pairs of Libyan spontaneous conversational speech, which were from 100 speakers. For this collection, 2 speakers of each group performed the recording in separate quiet rooms. 21 topics were contained in this dataset. The audio duration is about 118.5 hours and the pure recording time is about 56.0 hours, including the reasonable leading and trailing silence. The total size of this dataset is 12.7 G.
Lithuanian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the involvement of 201 speakers, comprising 99 males and 102 females. All participants in the recording were carefully selected by professionals to ensure standardized pronunciation and clear enunciation. The recorded texts span various types of information, including news and everyday conversations.
Luxembourg German Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with 10 speakers participating, including 5 males and 5 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts cover voicemail, paragraph dictation, and other information.
Macedonian Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the participation of 402 speakers, including 185 males and 217 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news and everyday conversations.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More