ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Free dialogue in Odia Speech Corpus
【Product Type】Odia language from India, free conversation, mobile 16K 【Corpus Type】 Home, health, travel, education, work, gourmet food, marriage, movies, music, socializing, celebrities, weather, sports, and other common topics in daily life Natural context, applicable to the entire industry 【Pronouncer Information】 Gender: Male 44%, Female 56% Age: Pronouncers mainly cover the age range of 16-45 Accent: Pronouncers are from Odisha state.
Ten Thousand People Dialect with High-Quality Labeling Speech Corpus
This dataset covers 29,954 dialect speakers from 26 provinces in China, ranging in age from 12 to 75, with a total recording time of 34,073 hours and an average recording duration of nearly 60 minutes, maintaining a balanced gender ratio. The topics covered are very extensive, including news, text messages, vehicle control, music, general, maps, daily colloquial speech, family, health, travel, work, socializing, celebrities, weather, and other common life topics.
Chinese dialect Speech Corpus
【Pronunciation Speaker Information】 Gender: The ratio of male to female pronunciation speakers is approximately 1:1. Age: Pronunciation speakers cover the age range from 16 to 60 years old. Accents: Fujian, Guangdong, Hunan, Jiangxi, Wu (Suzhou), Yunnan, Guizhou, Wu (Shanghai), Tianjin, Anhui, Shandong, Henan, Liaoning (Shenyang/Anshan), Shaanxi, Shanxi, Hubei, Gansu, Wenzhou, Hebei, Liaoning (Dalian/Dandong), Wu (Zhejiang), Sichuan.
In-Vehicle Noise Corpus
Chinese Mandarin Speech Recognition Corpus
【Product Features】High sampling rate, in-vehicle corpus, indoor quiet collection, multiple scenarios (vehicle control, music, general, map, casual chat scenarios) Can be applied to in-vehicle and other common speech recognition scenarios. 【Audio Parameters】 16k: 1 person 0.5 hours 44.1k: 148 people 74.9 hours 48k:2463 people,1354.8 hours
Chinese Speech Corpus-Incabin
【Product Type】Chinese, Reading, Desktop Collection (16K) 【Product Features】Collected in-vehicle, various types of corpora (vehicle control, music, general, maps, casual conversation scenarios), over 100 recording scenarios. Applicable to the automotive field. 【Pronunciation Person Information】 Gender: Male 49%, Female 51% Age: Pronunciation people cover the age range of 15-60 years old, with approximately 10% over the age of 45. Accent: Equally distributed across the Chinese seven major accent regions.
Chinese-English Mixed Speech Recognition Corpus (Desktop)
【Product Features】 High sampling rate (44.1/48K), in-vehicle corpus, collected in a quiet indoor environment, multiple scenarios (vehicle control, music, general, maps, casual conversation, English interaction, audiobooks, etc.) Applicable to in-car and other common voice recognition scenarios.
Albanian Free Dialogue Speech Corpus
【Corpus Type】 Family, health, travel, education, work, gourmet food, marriage, movies, music, socializing, celebrities, weather, sports, and other common topics of daily life. Natural context, applicable to all industries. 【Pronunciation Person Information】 Gender: Male 45%, Female 55% Age: The pronunciation people mainly cover the age range of 16-45. Accent: Speakers are from Tirana.
Ethiopian Amharic Free Dialogue Speech Corpus
【Product Type】 Ethiopian Amharic language, free dialogue, mobile 16K 【Corpus Type】 Family, health, travel, education, work, cuisine, marriage, movies, music, socializing, celebrities, weather, sports, and other common topics of daily life. Natural context, applicable to all industries. 【Pronunciation Person Information】 Gender: Male 50%, Female 50% Age: The pronunciation people mainly cover the age range of 16-45. Accent: The pronunciation people mainly come from central Ethiopia.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More