NLP

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Indonesian TN Corpus
Collecting from news & chat corpus, performing tagging and classification of non-Indonesian characters.
Italian TN Corpus
Italy Email Corpus
Itinerary Conversation Corpus
Japanese Phonological Corpus
Japanese SMS Corpus with POS and NER
Collecting from news or daily chat corpus, and performing word segmentation, POS and NER.
Japanese TN Corpus
K12 (Primary/Junior/Senior High) Testing Questions Across all Subjects
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More