OCR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Czech OCR Image Corpus
This dataset consists of 11 categories and a total of 1135 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Czech Republic, and all the images in the dataset include labeling results.
English OCR Corpus
The english dataset consists of 21 categories, total of 2,637 printed images and 406 handwritten images, covering most commonly used scenarios in daily life, with all data labeled.
English OCR Image Corpus (multi-angles)
This dataset consists of english menu and road sign data, each menu/road sign photographed from 5 angles, total of 14,350 sets of menus, 6,029 sets of road signs, total of 101,895 images.
English OCR Image Corpus (one angle)
This dataset consists of english menus, street signs, taken in the United States, total of 6,050 images, all labeled.
French Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 11202 printed images, covering most commonly encountered scenarios in daily life. The data was collected in France, and all the images in the dataset include labeling results.
French OCR Image Corpus
This dataset consists of 11 categories and a total of 1000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in France, and all the images in the dataset include labeling results.
Georgian OCR Image Corpus
This dataset consists of 11 categories and a total of 2000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Georgia, and all the images in the dataset include labeling results.
German Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 12981 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.
German OCR Image Corpus
This dataset consists of 11 categories and a total of 1003 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More