Chinese OCR Corpus

The chinese dataset consists of 26 categories, total of 4,408 printed images and 510 handwritten images, covering most commonly used scenarios in daily life, with all data labeled.
Specifications:
ID:
King-OCR-039
Language:
Chinese
Data size
4918 pics
Data format
.jpg/.jpeg/.png
Data content
Including PPT type, document type, natural light photography, screenshots, and handwriting type.
Labeling Content
Line-level bounding box labeling and transcription for the texts
Accuracy Rate
The accuracy of the labeling results is 97%

People also searched for

Korean Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 6788 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Korea, and all the images in the dataset include labeling results.
Hindi Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
Thai Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 13882 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Thailand, and all the images in the dataset include labeling results.
Vietnamese Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 14015 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Vietnam, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.