This dataset consists of french, german, italian, spanish, portuguese, japanese, korean, russian, chinese, and english datasets, total of 10 languages of natural scenes and document categories, total of 44,821 images.
English, Chinese, French, Portuguese, Korean, Japanese, Italian, Russian, Spanish, German
Data size
44821 pics
Data format
.jpg/.jpeg/.png
Data content
ProductLabel, Menu, Ticket, Map, StoreName, AdvertisementSign, Flyer, Poster, Banner, BusinessCard, Receipt, BulletinBoard, StreetSign, Book, Magazine, Newspaper and Form
Labeling Content
Line-level bounding box labeling and transcription for the texts
Devices:
Mobile
Accuracy Rate
The accuracy of the labeling results is 97%
People also searched for
Korean Natural Scene OCR Image Corpus
This dataset consists of 8 categories and a total of 6788 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Korea, and all the images in the dataset include labeling results.
This dataset consists of 8 categories and a total of 21218 printed images, covering most commonly encountered scenarios in daily life. The data was collected in India, and all the images in the dataset include labeling results.
This dataset consists of 9 categories and a total of 13882 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Thailand, and all the images in the dataset include labeling results.
This dataset consists of 9 categories and a total of 14015 printed images, covering most commonly encountered scenarios in daily life. The data was collected in Vietnam, and all the images in the dataset include labeling results.