K12, University, and Graduate-Level Professional Subject Q&A Corpus

This dataset consisted of millions of high-quality Chain-of-Thought (CoT) questions are derived from authoritative sources and include questions and answers. These datasets undergo rigorous processing steps such as question screening, entry, duplicate checking, solving, review, and proofreading, followed by strict quality control to form standardized question banks.
Specifications:
ID:
King-NLP-013
Size:
Over 10 million sets
Language:
English, Chinese
Quantity
12 million K12 (Primary/Junior/Senior High School) full-subject Chinese Q&A
200,000 university-level Mathematics, Physics, Chemistry, and Computer Science Chinese Q&A.
500,000 university professional course (Business, Law, Medicine, etc.) Chinese Q&A
50,000 STEM competition (Mathematics, Physics, etc.) Chinese/English Bilingual Q&A

People also searched for

Tamil Text Normalization Corpus
Bengali Text Normalization Corpus
Swahili Text Normalization Corpus
Somali Text Normalization Corpus

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.