English OCR Corpus

The english dataset consists of 21 categories, total of 2,637 printed images and 406 handwritten images, covering most commonly used scenarios in daily life, with all data labeled.
Specifications:
ID:
King-OCR-040
Language:
English
Data size
3043 pics
Data format
.jpg/.jpeg/.png
Data content
Including PPT type, document type, natural light photography, screenshots, and handwriting type.
Labeling Content
Line-level bounding box labeling and transcription for the texts
Devices:
Mobile
Accuracy Rate
The accuracy of the labeling results is 97%

People also searched for

Korean Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 6788 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Korea, and all the images in the dataset include labeling results.
Hindi Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
Thai Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 13882 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Thailand, and all the images in the dataset include labeling results.
Vietnamese Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 14015 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Vietnam, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.