ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
People from Multi-Country Speak English Corpus
This corpus comprises recordings from 35,628 speakers with each speaker contributing between 10 to 60 minutes of speech. The gender distribution is approximately equal. The age range of the speakers spans from 7 to 80 years old. It includes a diverse array of accents, representing 64 countries including China, the United States, the United Kingdom, Canada, Australia, Japan, South Korea, and many others.
People from Multi-Country Speak Spanish Corpus
The corpus includes accents from Chile, Mexico, Spain, the United States, and several other countries, with recordings made using various devices such as desktops, Android phones, iOS phones, and more, captured in common environments like quiet indoor spaces, entertainment venues, cars, streets, and restaurants.
Polish American English Speech Recognition Corpus (Desktop+Mobile)
This dataset was recorded in a quiet office/home environment, with 100 speakers participating, including 49 males and 51 females. All participants in the recording were professionally selected to ensure standardized pronunciation and clear articulation. The recorded texts span topics such as news and everyday conversations.
Polish Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with a total of 200 speakers participating, including 98 males and 102 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers daily conversations, news, and other information.
Portugal English Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office/home environments, with a total of 201 speakers participating, including 90 males and 111 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers information such as news and daily conversations.
Portuguese Conversational Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet home environment, with the participation of 215 speakers, including 107 males and 108 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts span a range of topics, including information on computers, marriage, sports, and travel.
Portuguese Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with 200 speakers participating, including 89 males and 111 females. All speakers involved in the recording were carefully selected by professionals to ensure standardized pronunciation and clear enunciation. The recorded texts cover information such as news updates and everyday conversations.
Portuguese Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with the involvement of 50 speakers, comprising 26 males and 24 females. All participants in the recording were meticulously selected by professionals to ensure standardized pronunciation and articulate speech. The recorded texts encompass information such as numbers, dates, times, and personal names.
Portuguese Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet office environment, with 205 speakers participating, including 103 males and 102 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover information such as news updates and everyday conversations.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More