ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Ten Thousand People Speak Chinese Corpus
Reading and Conversation Data News, Text Messages, Car Control, Number Sequences, Music, General, Maps, Daily Colloquial Speech Family, Health, Travel, Work, Socializing, Celebrities, Weather, and other common life topics. Read Text: 10,051 people, 3,953 hours (no less than 1 minute per person, no less than 4 characters per sentence) Free Conversation: 3,844 people, 1,914 hours (Long Audio)
Ten Thousand People Speaks Chinese Dialect Corpus
This dataset covers 29,954 dialect speakers from 26 provinces in China, ranging in age from 12 to 75, with a total recording time of 34,073 hours and an average recording duration of nearly 60 minutes, maintaining a balanced gender ratio. The topics covered are very extensive, including news, text messages, vehicle control, music, general, maps, daily colloquial speech, family, health, travel, work, socializing, celebrities, weather, and other common life topics.
Thai Speech Recognition Corpus
The topic covers 11 industry including: construction, education, environment, finance, food and beverage, manufacturing, medical, retail, service, technology, travel.
Thai Speech Recognition Corpus (Desktop)
This dataset was recorded in quiet office and home environments, with 205 speakers participating, including 101 males and 104 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear articulation. The recorded texts cover information such as everyday conversations and news updates.
Thai Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office and home environments, with 205 speakers participating, including 101 males and 104 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover information such as news updates and everyday conversations.
Turkish Conversational Speech Recognition corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with the involvement of 400 speakers, comprising 221 males and 179 females. All participants in the recording were professionally screened to ensure standard pronunciation and clear enunciation. The recorded texts cover a range of topics, including health, sports, friendships, and lifestyle.
Turkish Conversational Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 50 speakers participating, including 26 males and 24 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers family, sports, travel, pets, and other information.
Turkish Speech Recognition Corpus (Desktop)
This dataset was recorded in a quiet environment, with 201 speakers participating, including 104 males and 97 females. All speakers involved in the recording were carefully selected by professionals to ensure standardized pronunciation and clear articulation. The recorded texts cover information such as news updates and everyday conversations.
Turkish Speech Recognition Corpus (Incar)
This dataset was recorded in a vehicular noise environment, with the participation of 316 speakers, including 156 males and 160 females. All speakers involved in the recording were carefully selected by professionals to ensure standardized pronunciation and clear articulation. The recorded texts cover information such as numbers, dates, times, and personal names.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More