OCR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Georgian OCR Image Corpus
This dataset consists of 11 categories and a total of 2000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Georgia, and all the images in the dataset include labeling results.
German Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 12981 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.
German OCR Image Corpus
This dataset consists of 11 categories and a total of 1003 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.
Greek OCR Image Corpus
This dataset consists of 11 categories and a total of 1005 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Greece, and all the images in the dataset include labeling results.
Handwritten Formula OCR Corpus
Data Type: Primary, Middle, and High School Math, Physics, and Chemistry Arithmetic Expressions (Collected content includes workbooks, exercise books, and mistake notebooks)
Hindi Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
Hungarian OCR Image Corpus
This dataset consists of 11 categories and a total of 1044 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Hungary, and all the images in the dataset include labeling results.
Italian Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 10714 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Italia, and all the images in the dataset include labeling results.
Japanese Handwriting OCR Corpus
This document provides Japanese handwritten data of 130 collectors. We scan the handwritten data into a picture, and mark the text in the picture with a rectangular box.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More