OCR

Georgian OCR Image Corpus

This dataset consists of 11 categories and a total of 2000 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Georgia, and all the images in the dataset include labeling results.

Document Digitization and Archiving Multilingual Support Tourist Guides

German Natural Scene OCR Image Corpus

This dataset consists of 9 categories and a total of 12981 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.

Multilingual Support Tourist Guides Urban Management

German OCR Image Corpus

This dataset consists of 11 categories and a total of 1003 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Germany, and all the images in the dataset include labeling results.

Document Digitization and Archiving Multilingual Support Tourist Guides

Greek OCR Image Corpus

This dataset consists of 11 categories and a total of 1005 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Greece, and all the images in the dataset include labeling results.

Document Digitization and Archiving Multilingual Support Tourist Guides

Handwritten Formula OCR Corpus

Data Type: Primary, Middle, and High School Math, Physics, and Chemistry Arithmetic Expressions (Collected content includes workbooks, exercise books, and mistake notebooks)

OCR Handwritten Formula

Hindi Natural Scene OCR Image Corpus

This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.

Multilingual Support Tourist Guides Urban Management

Hungarian OCR Image Corpus

This dataset consists of 11 categories and a total of 1044 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Hungary, and all the images in the dataset include labeling results.

Document Digitization and Archiving Multilingual Support Tourist Guides

Italian Natural Scene OCR Image Corpus

This dataset consists of 9 categories and a total of 10714 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Italia, and all the images in the dataset include labeling results.

Multilingual Support Tourist Guides Urban Management

Japanese Handwriting OCR Corpus

This document provides Japanese handwritten data of 130 collectors. We scan the handwritten data into a picture, and mark the text in the picture with a rectangular box.

Cultural Research Data Entry Automation Document Digitization and Archiving

Filter by

Georgian OCR Image Corpus

German Natural Scene OCR Image Corpus

German OCR Image Corpus

Greek OCR Image Corpus

Handwritten Formula OCR Corpus

Hindi Natural Scene OCR Image Corpus

Hungarian OCR Image Corpus

Italian Natural Scene OCR Image Corpus

Japanese Handwriting OCR Corpus

Get started

Filter by

Filter by

OCR

Filter by

Georgian OCR Image Corpus

German Natural Scene OCR Image Corpus

German OCR Image Corpus

Greek OCR Image Corpus

Handwritten Formula OCR Corpus

Hindi Natural Scene OCR Image Corpus

Hungarian OCR Image Corpus

Italian Natural Scene OCR Image Corpus

Japanese Handwriting OCR Corpus

Get started

Join our newsletter to stay updated

Filter by

Filter by