NLP

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Chinese Surname Polyphone Corpus
Chinese Text Messages Corpus
This dataset provides a specification of the contents of the Chinese SMS Corpus
Chinese Text Messages Corpus – Entity labeling
This dataset provides a specification of the contents of the Chinese SMS Corpus
Chinese Text Messages Corpus – Pronounce Labeling
This dataset provides a specification of the contents of the Chinese Email Corpus
Chinese Text Messages Corpus – Word segmentation labeling
This dataset provides a specification of the contents of the Chinese SMS Corpus
Chinese Text Messages Corpus (Include sender and time)
This dataset provides a specification of the contents of the Chinese SMS Corpus
Chinese-English Parallel Corpus
Daily data in Chinese and English, parallel corpus dataset
Chinese-English Prosody labeling Corpus
Competition-level Mathematics, Physics Reasoning Corpus
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More