OCR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Greek OCR Image Corpus
This dataset consists of 11 categories and a total of 1005 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Greece, and all the images in the dataset include labeling results.
Hindi Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
Hungarian OCR Image Corpus
This dataset consists of 11 categories and a total of 1044 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Hungary, and all the images in the dataset include labeling results.
Italian Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 10714 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Italia, and all the images in the dataset include labeling results.
Japanese Handwriting OCR Corpus
This document provides Japanese handwritten data of 130 collectors. We scan the handwritten data into a picture, and mark the text in the picture with a rectangular box.
Japanese OCR Image Corpus
This dataset consists of 11 categories and a total of 1002 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Japan, and all the images in the dataset include labeling results.
Japanese OCR Image Corpus (one angle)
This dataset consists of japanese dataset, covering multiple categories, taken in Japan, total of 1,066 images.
Korean Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 6788 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Korea, and all the images in the dataset include labeling results.
Mongolian OCR Image Corpus
This dataset consists of 9 categories and a total of 1001 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Inner Mongolia, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More