Chinese-English Mixed Speech Recognition Corpus

This dataset is designed to support the development and refinement of bilingual Chinese-English mixed speech recognition technologies. It contains a diverse set of speech samples recorded in various scenarios to train and test speech recognition systems. Recordings were primarily made using desktop devices to mimic everyday usage environments. All recordings are made using high-fidelity equipment to ensure clarity and improve the accuracy of speech recognition systems. Recordings were conducted in noise-free or low-noise environments to minimize the impact of background noise on speech recognition performance. Speakers are selected from the seven major Chinese dialect regions to achieve a balanced representation of regional accents.
Specifications:
ID:
King-ASR-951
Size:
2000 hours
Language:
English, Chinese
Devices:
Desktop

People also searched for

Free dialogue in Odia Speech Corpus
【Product Type】Odia language from India, free conversation, mobile 16K 【Corpus Type】 Home, health, travel, education, work, gourmet food, marriage, movies, music, socializing, celebrities, weather, sports, and other common topics in daily life Natural context, applicable to the entire industry 【Pronouncer Information】 Gender: Male 44%, Female 56% Age: Pronouncers mainly cover the age range of 16-45 Accent: Pronouncers are from Odisha state.
Ten Thousand People Dialect with High-Quality Labeling Speech Corpus
This dataset covers 29,954 dialect speakers from 26 provinces in China, ranging in age from 12 to 75, with a total recording time of 34,073 hours and an average recording duration of nearly 60 minutes, maintaining a balanced gender ratio. The topics covered are very extensive, including news, text messages, vehicle control, music, general, maps, daily colloquial speech, family, health, travel, work, socializing, celebrities, weather, and other common life topics.
Chinese dialect Speech Corpus
【Pronunciation Speaker Information】 Gender: The ratio of male to female pronunciation speakers is approximately 1:1. Age: Pronunciation speakers cover the age range from 16 to 60 years old. Accents: Fujian, Guangdong, Hunan, Jiangxi, Wu (Suzhou), Yunnan, Guizhou, Wu (Shanghai), Tianjin, Anhui, Shandong, Henan, Liaoning (Shenyang/Anshan), Shaanxi, Shanxi, Hubei, Gansu, Wenzhou, Hebei, Liaoning (Dalian/Dandong), Wu (Zhejiang), Sichuan.
In-Vehicle Noise Corpus

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.