K12, University, and Graduate-Level Professional Subject Q&A Corpus

This dataset consisted of millions of high-quality Chain-of-Thought (CoT) questions are derived from authoritative sources and include questions and answers. These datasets undergo rigorous processing steps such as question screening, entry, duplicate checking, solving, review, and proofreading, followed by strict quality control to form standardized question banks.
Specifications:
ID:
King-NLP-013
Size:
Over 10 million sets
Language:
English, Chinese
Quantity
12 million K12 (Primary/Junior/Senior High School) full-subject Chinese Q&A
200,000 university-level Mathematics, Physics, Chemistry, and Computer Science Chinese Q&A.
500,000 university professional course (Business, Law, Medicine, etc.) Chinese Q&A
50,000 STEM competition (Mathematics, Physics, etc.) Chinese/English Bilingual Q&A

People also searched for

Competition-level Mathematics, Physics Reasoning Corpus
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities.
University-level Business, Law, Medicine Reasoning Corpus
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities.
University-level Mathematics, Physics, Chemistry, Computer Science Reasoning Corpus
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities.
K12 (Primary/Junior/Senior High) Testing Questions Across all Subjects
This dataset is for AI models to train to learn to extract critical information from problem statements and methodically derive solutions. This type of dataset proves particularly valuable for developing automated question-answering systems and AI applications requiring sophisticated reasoning capabilities

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.