Chinese-English Mixed Speech Recognition Corpus

This dataset is designed to support the development and refinement of bilingual Chinese-English mixed speech recognition technologies. It contains a diverse set of speech samples recorded in various scenarios to train and test speech recognition systems. Recordings were primarily made using desktop devices to mimic everyday usage environments. All recordings are made using high-fidelity equipment to ensure clarity and improve the accuracy of speech recognition systems. Recordings were conducted in noise-free or low-noise environments to minimize the impact of background noise on speech recognition performance. Speakers are selected from the seven major Chinese dialect regions to achieve a balanced representation of regional accents.
Specifications:
ID:
King-ASR-951
Size:
2000 hours
Language:
English, Chinese
Devices:
Desktop

People also searched for

Thai Speech Recognition Corpus
The topic covers 11 industry including: construction, education, environment, finance, food and beverage, manufacturing, medical, retail, service, technology, travel.
Indonesian Speech Recognition Corpus
Family, Health, Music, Shopping, Sports, Travel, Work, Food, Education, Movies, Social Networking, Friends, Entertainment, News, Pets, Computers, Television, Celebrities, Life, Marriage, Weather
Chinese English & American English Speech Recognition Corpus-(PC/Pad)
Chinese English & American English Speech Recognition Corpus (desktop)
This dataset includes 6 topic for Office White-collar Meeting Scenario - IT and Internet, Finance, Clear Energy, Healthcare, Media and Consumer Electronics.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.