OCR

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Natural Scene and Document OCR Image Corpus(10 Countries)
This dataset consists of french, german, italian, spanish, portuguese, japanese, korean, russian, chinese, and english datasets, total of 10 languages of natural scenes and document categories, total of 44,821 images.
Portuguese Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 15385 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Portugal, and all the images in the dataset include labeling results.
Portuguese OCR Image Corpus
This dataset consists of 11 categories and a total of 1013 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Portugal, and all the images in the dataset include labeling results.
Russian OCR Image Corpus
This dataset consists of 11 categories and a total of 1000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Russia, and all the images in the dataset include labeling results.
Slovak OCR Image Corpus
This dataset consists of 11 categories and a total of 1110 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Slovakia, and all the images in the dataset include labeling results.
Spanish Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 15126 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Spain, and all the images in the dataset include labeling results
Spanish OCR Image Corpus
This dataset consists of 10 categories and a total of 1061 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Spain, and all the images in the dataset include labeling results.
Tamil OCR Image Corpus
This dataset consists of 11 categories and a total of 1493 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
Thai Natural Scene OCR Image Corpus
This dataset consists of 9 categories and a total of 13882 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Thailand, and all the images in the dataset include labeling results.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More