Natural Scene and Document OCR Image Corpus(10 Countries)

This dataset consists of french, german, italian, spanish, portuguese, japanese, korean, russian, chinese, and english datasets, total of 10 languages of natural scenes and document categories, total of 44,821 images.
Specifications:
ID:
King-OCR-007
Language:
English, Chinese, French, Portuguese, Korean, Japanese, Italian, Russian, Spanish, German
Data size
44821 pics
Data format
.jpg/.jpeg/.png
Data content
ProductLabel, Menu, Ticket, Map, StoreName, AdvertisementSign, Flyer, Poster, Banner, BusinessCard, Receipt, BulletinBoard, StreetSign, Book, Magazine, Newspaper and Form
Labeling Content
Line-level bounding box labeling and transcription for the texts
Devices:
Mobile
Accuracy Rate
The accuracy of the labeling results is 97%

People also searched for

Ukrainian Handwritten Checklist Corpus
Data type: Handwritten content (including notes, tables, etc.) and blackboard writing
Russian Handwritten Checklist Corpus
Data type: Handwritten content (including notes, tables, etc.) and blackboard writing
Traditional Chinese Handwritten Checklist Corpus
Data type: Handwritten content (including notes, tables, etc.) and blackboard writing
Simplified Chinese Handwritten Checklist Corpus
Data Type: Handwritten content (including notes, tables, etc.) and blackboard writing

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.