NLP

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Regularization of non-chinese characters
Russian TN Corpus
SC and TC Chinese Pinyin Corpus
Simplified Chinese Email Corpus with Signature Entity labeling
Simplified Chinese TN Corpus
Smart Device Application Scenario Corpus
Spain Email Corpus
Spanish TN Corpus
Collecting from news & chat corpus, performing tagging and classification of non-Spanish characters.
Text corpora in Shanghainese

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More