ASR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Hokkien Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet office/home environment, with a total of 249 speakers participating, including 99 males and 150 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, daily conversations, and other information.
Hong Kong Cantonese Conversational Speech Recognition Corpus (Telephone)
This dataset was recorded in a quiet home environment, with the participation of 180 speakers, including 84 males and 96 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover entertainment, food, marriage, and sports.
Hong Kong Cantonese Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 51 speakers participating, including 25 males and 26 females. All speakers who took part in the recording were professionally screened to ensure standard pronunciation and clear enunciation. The recorded texts cover information from news, everyday conversations, Twitter, and other similar content.
Hong Kong Cantonese Speech Recognition Corpus (Mobile)
This dataset was recorded in quiet office and home environments, with the participation of 579 speakers, including 271 males and 308 females. All speakers involved in the recordings were carefully selected by professionals to ensure standardized pronunciation and clear articulation. The recorded texts encompass information such as news updates and everyday chats.
Hong Kong Cantonese Speech Recognition Corpus (Smart Switch)
This dataset was recorded in noisy environments, with the participation of 76 speakers, including 29 males and 47 females. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover information on greeting phrases such as "Hello Mao Mao" and wake-up words.
Hong Kong Cantonese-English Mixed Corpus
Daily Conversation Scenarios: Including commonly used English words, abbreviations, names, software, trademarks, shop names, etc. in Cantonese
Hong Kong English Business Meeting Recognition Voice Library – Conversations (Mobile)
Business Meeting Conversations Topic: Finance, Healthcare, R&D, Internet... Group Size: 2-4 People per Group
Hong Kong English Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office/home environment, with a total of 200 speakers participating, including 99 males and 101 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers news, forums, SMS, Twitter, and other information.
Hong Kong English Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet office environment, with the participation of 212 speakers, including 90 males and 122 females. All speakers involved in the recording were professionally screened to ensure standard pronunciation and clear articulation. The recorded texts cover various types of information, such as news and everyday conversations.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More