OCR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Japanese OCR Image Corpus
This dataset consists of 11 categories and a total of 1002 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Japan, and all the images in the dataset include labeling results.
Japanese OCR Image Corpus (one angle)
This dataset consists of japanese dataset, covering multiple categories, taken in Japan, total of 1,066 images.
Korean Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 6788 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Korea, and all the images in the dataset include labeling results.
Mongolian OCR Image Corpus
This dataset consists of 9 categories and a total of 1001 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Inner Mongolia, and all the images in the dataset include labeling results.
Natural Scene and Document OCR Image Corpus(10 Countries)
This dataset consists of french, german, italian, spanish, portuguese, japanese, korean, russian, chinese, and english datasets, total of 10 languages of natural scenes and document categories, total of 44,821 images.
Portuguese Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 15385 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Portugal, and all the images in the dataset include labeling results.
Portuguese OCR Image Corpus
This dataset consists of 11 categories and a total of 1013 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Portugal, and all the images in the dataset include labeling results.
Russian Handwritten Checklist Corpus
Data type: Handwritten content (including notes, tables, etc.) and blackboard writing
Russian OCR Image Corpus
This dataset consists of 11 categories and a total of 1000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Russia, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More